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Preface 



This book evolved from a set of lecture notes for a course on ‘Introduction to 
Mathematical Physics’, that I have given at California State University, Stanislaus 
(CSUS) for many years. Physics majors at CSUS take introductory mathematical 
physics before the physics core courses, so that they may acquire the expected 
level of mathematical competency for the core course. It is assumed that the 
student has an adequate preparation in general physics and a good understanding 
of the mathematical manipulations of calculus. For the student who is in need of a 
review of calculus, however, Appendix 1 and Appendix 2 are included. 

This book is not encyclopedic in character, nor does it give in a highly mathe- 
matical rigorous account. Our emphasis in the text is to provide an accessible 
working knowledge of some of the current important mathematical tools required 
in physics. 

The student will find that a generous amount of detail has been given mathe- 
matical manipulations, and that ‘it-may-be-shown-thats’ have been kept to a 
minimum. However, to ensure that the student does not lose sight of the develop- 
ment underway, some of the more lengthy and tedious algebraic manipulations 
have been omitted when possible. 

Each chapter contains a number of physics examples to illustrate the mathe- 
matical techniques just developed and to show their relevance to physics. They 
supplement or amplify the material in the text, and are arranged in the order in 
which the material is covered in the chapter. No effort has been made to trace the 
origins of the homework problems and examples in the book. A solution manual 
for instructors is available from the publishers upon adoption. 

Many individuals have been very helpful in the preparation of this text. I wish 
to thank my colleagues in the physics department at CSUS. 

Any suggestions for improvement of this text will be greatly appreciated. 

Turlock, California tai l. chow 

2000 
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Vector and tensor analysis 



Vectors and scalars 

Vector methods have become standard tools for the physicists. In this chapter we 
discuss the properties of the vectors and vector fields that occur in classical 
physics. We will do so in a way, and in a notation, that leads to the formation 
of abstract linear vector spaces in Chapter 5. 

A physical quantity that is completely specified, in appropriate units, by a single 
number (called its magnitude) such as volume, mass, and temperature is called a 
scalar. Scalar quantities are treated as ordinary real numbers. They obey all the 
regular rules of algebraic addition, subtraction, multiplication, division, and so 
on. 

There are also physical quantities which require a magnitude and a direction for 
their complete specification. These are called vectors if their combination with 
each other is commutative (that is the order of addition may be changed without 
affecting the result). Thus not all quantities possessing magnitude and direction 
are vectors. Angular displacement, for example, may be characterised by magni- 
tude and direction but is not a vector, for the addition of two or more angular 
displacements is not, in general, commutative (Fig. 1.1). 

In print, we shall denote vectors by boldface letters (such as A) and use ordin- 
ary italic letters (such as A) for their magnitudes; in writing, vectors are usually 
represented by a letter with an arrow above it such as A. A given vector A (or A) 
can be written as 



A = AA, (1.1) 

where A is the magnitude of vector A and so it has unit and dimension, and A is a 
dimensionless unit vector with a unity magnitude having the direction of A. Thus 
i = A/A. 
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Figure 1.1. Rotation of a parallelpiped about coordinate axes. 



A vector quantity may be represented graphically by an arrow-tipped line seg- 
ment. The length of the arrow represents the magnitude of the vector, and the 
direction of the arrow is that of the vector, as shown in Fig. 1 .2. Alternatively, a 
vector can be specified by its components (projections along the coordinate axes) 
and the unit vectors along the coordinate axes (Fig. 1.3): 

3 

X = A l e l + A 2 e 2 + Ae 3 = '^^A i e i , (1-2) 

i= 1 

where e t (i = 1, 2, 3) are unit vectors along the rectangular axes x t (vq = x, x 2 = y, 
x 3 = z); they are normally written as i. j, k in general physics textbooks. The 
component triplet (A l ,A 2 ,A 3 ) is also often used as an alternate designation for 
vector A: 



A — (A h A 2 , A 3 ). (1.2a) 

This algebraic notation of a vector can be extended (or generalized) to spaces of 
dimension greater than three, where an ordered n-tuple of real numbers, 
(Ai, A 2 , . . . , A n ), represents a vector. Even though we cannot construct physical 
vectors for n > 3, we can retain the geometrical language for these ^-dimensional 
generalizations. Such abstract “vectors” will be the subject of Chapter 5. 




Figure 1.2. Graphical representation of vector A. 
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DIRECTION ANGLES AND DIRECTION COSINES 



% 




Figure 1.3. A vector A in Cartesian coordinates. 



Direction angles and direction cosines 

We can express the unit vector A in terms of the unit coordinate vectors e ; . From 
Eq. (1.2), A = A x e x + A 2 e 2 + Ae 3 , we have 



A = A 




= AA. 



Now Ail A = cos a,A 2 / A = cos/3, and A 3 /A = COS 7 are the direction cosines of 
the vector A, and a, (3, and 7 are the direction angles (Fig. 1.4). Thus we can write 



A = A( cosed 5 ! +cos (3e 2 + cos 7 ^ 3 ) = v4^4; 



it follows that 

A = (cosed 5 ! +cos (3e 2 + cos 7 ^ 3 ) = (cos a, cos/3, cos 7 ). (1.3) 



*3 




Figure 1.4. Direction angles of vector A. 
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VECTOR AND TENSOR ANALYSIS 



Vector algebra 
Equality of vectors 

Two vectors, say A and B, are equal if, and only if, their respective components 
are equal: 

A = B or (A l ,A 2 ,A 3 ) = (Bi,B 2 ,B 3 ) 
is equivalent to the three equations 

A x = Bx,A 2 = B 2 ,A 3 = B 3 . 

Geometrically, equal vectors are parallel and have the same length, but do not 
necessarily have the same position. 



Vector addition 

The addition of two vectors is defined by the equation 

A + B = (Ax,A 2 , A 3 ) + (. Bx,B 2 , B 3 ) = ( A\ + By, A 2 + B 2 , A 3 + B 3 ). 

That is, the sum of two vectors is a vector whose components are sums of the 
components of the two given vectors. 

We can add two non-parallel vectors by graphical method as shown in Fig. 1.5. 
To add vector B to vector A, shift B parallel to itself until its tail is at the head of 
A. The vector sum A + B is a vector C drawn from the tail of A to the head of B. 
The order in which the vectors are added does not affect the result. 



Multiplication by a scalar 

If c is scalar then 

cA = (cA\, cA 2 , cA 3 ). 

Geometrically, the vector cA is parallel to A and is c times the length of A. When 
c = — 1 , the vector —A is one whose direction is the reverse of that of A, but both 




Figure 1.5. Addition of two vectors. 
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THE SCALAR PRODUCT 



have the same length. Thus, subtraction of vector B from vector A is equivalent to 
adding — B to A: 

A — B = A + (— B). 

We see that vector addition has the following properties: 

(a) A + B = B + A (commutativity); 

(b) (A + B) + C = A + (B + C) (associativity); 

(c) A + 0 = 0 + A = A; 

(d) A + (—A) = 0. 

We now turn to vector multiplication. Note that division by a vector is not 
defined: expressions such as k/A or B/A are meaningless. 

There are several ways of multiplying two vectors, each of which has a special 
meaning; two types are defined. 



The scalar product 

The scalar (dot or inner) product of two vectors A and B is a real number defined 
(in geometrical language) as the product of their magnitude and the cosine of the 
(smaller) angle between them (Figure 1.6): 

A B = AB cos 9 (0<6><7 t). (1.4) 

It is clear from the definition (1.4) that the scalar product is commutative: 

A B = B A, (1.5) 

and the product of a vector with itself gives the square of the dot product of the 
vector: 

A A = A 2 . (1.6) 

If A • B = 0 and neither A nor B is a null (zero) vector, then A is perpendicular to B. 




Figure 1.6. The scalar product of two vectors. 
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VECTOR AND TENSOR ANALYSIS 



We can get a simple geometric interpretation of the dot product from an 
inspection of Fig. 1.6: 

(B cos 9) A = projection of B onto A multiplied by the magnitude of A, 

(A cos 9)B = projection of A onto B multiplied by the magnitude of B. 



If only the components of A and B are known, then it would not be practical to 
calculate A • B from definition (1.4). But, in this case, we can calculate A • B in 
terms of the components: 

A B = (A l e l + A 2 e 2 + A 3 e 3 ) • {B l e l + B 2 e 2 + B 3 e 3 ); (1.7) 

the right hand side has nine terms, all involving the product e,- • ej . Fortunately, 
the angle between each pair of unit vectors is 90°, and from (1.4) and (1.6) we find 
that 



fa ■ • fa ■ = fa.. 

1 J U 5 



where is the Kronecker delta symbol 




i,j =1,2,3, 



if i t ^j, 
if i = j. 



( 1 . 8 ) 



(1.9) 



After we use (1.8) to simplify the resulting nine terms on the right-side of (7), we 
obtain 



3 

A B = A X B\ + A 2 B 2 + A 3 B 3 = A t B t . (1.10) 

1=1 

The law of cosines for plane triangles can be easily proved with the application 
of the scalar product: refer to Fig. 1.7, where C is the resultant vector of A and B. 
Taking the dot product of C with itself, we obtain 

c 2 = C • C = (A + B) • (A + B) 

= A 2 + B 2 + 2A ■ B = A 2 + B 1 + 2ABcos9 } 



which is the law of cosines. 




Figure 1.7. Law of cosines. 
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THE VECTOR (CROSS OR OUTER) PRODUCT 



A simple application of the scalar product in physics is the work W done by a 
constant force F: W = F • r, where r is the displacement vector of the object 
moved by F. 



The vector (cross or outer) product 



The vector product of two vectors A and B is a vector and is written as 

C = A x B. (1.11) 



As shown in Fig. 1.8, the two vectors A and B form two sides of a parallelogram. 
We define C to be perpendicular to the plane of this parallelogram with its 
magnitude equal to the area of the parallelogram. And we choose the direction 
of C along the thumb of the right hand when the fingers rotate from A to B (angle 
of rotation less than 180°). 

C = A x B = sin 9e c (0 < 9 < tt). (1.12) 



From the definition of the vector product and following the right hand rule, we 
can see immediately that 

A x B = — B x A. (1.13) 

Hence the vector product is not commutative. If A and B are parallel, then it 
follows from Eq. (1.12) that 

A x B = 0. (1.14) 



In particular 

Ax A = 0. 



(1.14a) 



In vector components, we have 

A x B = (H[C[ A.2^2 -P x^^) x (^ 1^1 + ^2^2 T~ ^3^3) • (1.15) 
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VECTOR AND TENSOR ANALYSIS 



Using the following relations 

O x e, = 0, i = 1,2,3, 

U x e 2 = e 3 , e 2 x e 3 = e u e 3 x r, = <? 2 , 



(1.16) 



Eq. (1.15) becomes 

A x B = ( A 2 B 3 — A 3 B 2 )ei + (A 3 B t — A\B 3 )e 2 + {A l B 2 — A 2 Bf)e 3 . (1.15a) 
This can be written as an easily remembered determinant of third order: 



<?i e 2 e 3 



A x B = 



Bi 



A 2 A 3 . 

b 2 b 3 



(1.17) 



The expansion of a determinant of third order can be obtained by diagonal multi- 
plication by repeating on the right the first two columns of the determinant and 
adding the signed products of the elements on the various diagonals in the result- 
ing array: 



a i 

b\ 

Cl 



«2 X /U, 

c? " c/ 



a x a 2 
b\ b 2 



- C ^ C \ 



The non-commutativity of the vector product of two vectors now appears as a 
consequence of the fact that interchanging two rows of a determinant changes its 
sign, and the vanishing of the vector product of two vectors in the same direction 
appears as a consequence of the fact that a determinant vanishes if one of its rows 
is a multiple of another. 

The determinant is a basic tool used in physics and engineering. The reader is 
assumed to be familiar with this subject. Those who are in need of review should 
read Appendix II. 

The vector resulting from the vector product of two vectors is called an axial 
vector, while ordinary vectors are sometimes called polar vectors. Thus, in Eq. 
(1.11), C is a pseudovector, while A and B are axial vectors. On an inversion of 
coordinates, polar vectors change sign but an axial vector does not change sign. 

A simple application of the vector product in physics is the torque r of a force F 
about a point 0\ t = F x r, where r is the vector from O to the initial point of the 
force F (Fig. 1.9). 

We can write the nine equations implied by Eq. (1.16) in terms of permutation 
symbols e ijk \ 

U X Cj ^ ijlA' k , 
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THE VECTOR (CROSS OR OUTER) PRODUCT 




Figure 1.9. The torque of a force about a point O. 
where £ ijk is defined by 

{ +1 if ( i,j,k ) is an even permutation of (1,2,3), 

— 1 if ( i,j,k ) is an odd permutation of (1,2,3), (1.18) 

0 otherwise (for example, if 2 or more indices are equal). 

It follows immediately that 

-ijk ^kij ^jki ~jik ~A/7 -/A.;/ ■ 

There is a very useful identity relating the £ ijk and the Kronecker delta symbol: 

3 

^ ~ mnk^ ijk ^mi^nj ^mj^nh (1-19) 

k= 1 



^ ^ -mik^njk 2(5 mm ^ . ! - (/A 6. (1.19a) 

j,k ij,k 

Using permutation symbols, we can now write the vector product A x B as 

A x B ( y 1 c . j x | y ' BjCj J x c I ) y ) (AjBjSjjjy) 

V '=1 / \j= 1 / ij ij,k 

Thus the /cth component of A x B is 

(A x B), y ) AjBj£jj k ^ ^ E-kijAjBj. 
ij ij 

If k = 1, we obtain the usual geometrical result: 

(A x B)j = Y^A.B, = s l23 A 2 B 3 + e l32 A 3 B 2 = A 2 B 3 — A 3 B 2 . 
ij 
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The triple scalar product A • (B x C) 

We now briefly discuss the scalar A • (B x C). This scalar represents the volume of 
the parallelepiped formed by the coterminous sides A, B, C, since 

A • (B x C) = ABC sin 6 cos a = hS = volume, 

S being the area of the parallelogram with sides B and C, and h the height of the 
parallelogram (Fig. 1.10). 

Now 



h e 2 e 3 



A • (B x C) 



(A|gi + A 2 e 2 + A 3 e 3 ) • 



S, 

Ci 



b 2 b 3 

C 2 c 3 



— A[(B 2 C 2 — S 3 C 2 ) + A 2 (B 3 C l — B\C 3 ) + A 3 (BjC 2 — B 2 C t ) 



so that 



A ■ (B x C) = 



A 1 A 2 A 3 
B\ B 2 B 3 

c, c 2 c 3 



( 1 . 20 ) 



The exchange of two rows (or two columns) changes the sign of the determinant 
but does not change its absolute value. Using this property, we find 



A • (B x C) 



^1 


^2 


^3 




Ci 


C 2 


c 3 


Bx 


b 2 


s 3 


= - 


Si 


S 2 


s 3 


Ci 


c 2 


c 3 




^1 


^2 


^3 



that is, the dot and the cross may be interchanged in the triple scalar product. 

A- (B x C) = (A x B) • C (1.21) 
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In fact, as long as the three vectors appear in cyclic order, A — > B — > C — > A, then 
the dot and cross may be inserted between any pairs: 

A • (B x C) = B • (C x A) = C • (A x B). 

It should be noted that the scalar resulting from the triple scalar product changes 
sign on an inversion of coordinates. For this reason, the triple scalar product is 
sometimes called a pseudoscalar. 



The triple vector product 

The triple product A x (B x C) is a vector, since it is the vector product of two 
vectors: A and B x C. This vector is perpendicular to B x C and so it lies in the 
plane of B and C. If B is not parallel to C, A x (B x C) = xB + yC. Now dot both 
sides with A and we obtain x(A • B) + y(A • C) = 0, since A ■ [A x (B x C)] = 0. 
Thus 

x/( A ■ C) = — y/( A • B) = A (A is a scalar) 

and so 

A x (B x C) = xB + yC = A[B(A ■ C) - C(A • B)]. 

We now show that A = 1. To do this, let us consider the special case when B = A. 
Dot the last equation with C: 

C x [A x (A x C)] = A[(A • C) 2 - A 2 C 2 ], 

or, by an interchange of dot and cross 

-(A-C) 2 = A[(A-C) 2 - a 2 c 2 ]. 

In terms of the angles between the vectors and their magnitudes the last equation 
becomes 

—A 2 C 2 sin 2 9 = A (A 2 C 2 cos 2 9 - A 2 C 2 ) = ~XA 2 C 2 sin 2 9; 
hence A = 1. And so 

A x (B x C) = B(A- C) -C(A-B). (1.22) 



Change of coordinate system 

Vector equations are independent of the coordinate system we happen to use. But 
the components of a vector quantity are different in different coordinate systems. 
We now make a brief study of how to represent a vector in different coordinate 
systems. As the rectangular Cartesian coordinate system is the basic type of 
coordinate system, we shall limit our discussion to it. Other coordinate systems 
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will be introduced later. Consider the vector A expressed in terms of the unit 
coordinate vectors (e\,e 2 ,e 3 ): 



3 

A = A\e i + A 2 e 2 + Ae 3 = A/ij. 

i= 1 

Relative to a new system (e[,e 2 ,e 3 ) that has a different orientation from that of 
the old system (ei,e 2 ,e 3 ), vector A is expressed as 

3 

A = A[e[ + A 2 e 2 + A' e 3 = A,'e,' . 

i= 1 

Note that the dot product A • e[ is equal to A[, the projection of A on the direction 
of e\ ; A • e 2 is equal to A 2 , and A • ej is equal to A 3 . Thus we may write 

A[ = (e\ ■ e[) A\ + ( e 2 ■ e[)A 2 + (e 3 • e[)A 3 , 'l 

A 2 = (ei • e 2 )A 1 + (, e 2 ■ e 2 )A 2 + (e 3 • e 2 )A 3l > (1-23) 

A 3 = (e 1 • e 3 )Ai + ( e 2 ■ e 3 )A 2 + (e 3 • e 3 )A 3 . ) 

The dot products (e i ■ e') are the direction cosines of the axes of the new coordi- 
nate system relative to the old system: e\ • £j = cos (x/, Xj); they are often called the 
coefficients of transformation. In matrix notation, we can write the above system 
of equations as 

/ A i \ fe \-e[ e 2 ■ e[ e 3 -e[ \ ( A { \ 

\A 2 = • e 2 e 2 -e 2 e 3 ■ e 2 \\ A 2 . 

\A 3 ) e 2 ■ e 3 e 3 -e 3 ) \A 3 J 

The 3x3 matrix in the above equation is called the rotation (or transformation) 
matrix, and is an orthogonal matrix. One advantage of using a matrix is that 
successive transformations can be handled easily by means of matrix multiplica- 
tion. Let us digress for a quick review of some basic matrix algebra. A full account 
of matrix method is given in Chapter 3. 

A matrix is an ordered array of scalars that obeys prescribed rules of addition 
and multiplication. A particular matrix element is specified by its row number 
followed by its column number. Thus cijj is the matrix element in the z'th row and 
/th column. Alternative ways of representing matrix A are [a,-,-] or the entire array 





( a n 


a l2 


«1 n \ 


A = 


a 2 \ 


a 22 


&2n 




y a ml 


a m2 • 


■ • ^mn J 
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A is an n x m matrix. A vector is represented in matrix form by writing its 
components as either a row or column array, such as 



B — {b\\ b\ 2 h 13 ) 



or 




where b n = b x ,b n = b y ,b n = b z , and c n = c x ,c 2 \ = _c v ,c 31 = c z . 

The multiplication of a matrix A and a matrix B is defined only when the 
number of columns of A is equal to the number of rows of B, and is performed 
in the same way as the multiplication of two determinants: if C = AB , then 

c ij = ^ a ikbkl- 
k 

We illustrate the multiplication rule for the case of the 3x3 matrix A multiplied 
by the 3x3 matrix B: 




If we denote the direction cosines e[ ■ e ; - by Xy, then Eq. (1.23) can be written as 

3 3 

A' l =Y,^- S i A i = Y. X ‘J A r (1.23a) 

7=1 7=1 

It can be shown (Problem 1.9) that the quantities Xy satisfy the following relations 

3 

X>7 A * = $k (j,k= 1,2,3). (1.24) 

1=1 

Any linear transformation, such as Eq. (1.23a), that has the properties required by 
Eq. (1.24) is called an orthogonal transformation, and Eq. (1.24) is known as the 
orthogonal condition. 



The linear vector space V n 

We have found that it is very convenient to use vector components, in particular, 
the unit coordinate vectors e t (i = 1, 2, 3). The three unit vectors e, are orthogonal 
and normal, or, as we shall say, orthonormal. This orthonormal property 
is conveniently written as Eq. (1.8). But there is nothing special about these 
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orthonormal unit vectors e,-. If we refer the components of the vectors to a 
different system of rectangular coordinates, we need to introduce another set of 
three orthonormal unit vectors /j,/ 2 , and f 3 . 

fifj ty (i,j =1,2,3). (1.8a) 

For any vector A we now write 

3 

A = 5>/„ and c i=fr A - 

i= 1 

We see that we can define a large number of different coordinate systems. But 
the physically significant quantities are the vectors themselves and certain func- 
tions of these, which are independent of the coordinate system used. The ortho- 
normal condition (1.8) or (1.8a) is convenient in practice. If we also admit oblique 
Cartesian coordinates then the f need neither be normal nor orthogonal; they 
could be any three non-coplanar vectors, and any vector A can still be written as a 
linear superposition of the f t 



A — c \f\ + c ifi + c iA- (1-25) 

Starting with the vectors f, we can find linear combinations of them by the 
algebraic operations of vector addition and multiplication of vectors by scalars, 
and then the collection of all such vectors makes up the three-dimensional linear 
space often called V 3 (V for vector) or R 3 ( R for real) or E 3 ( E for Euclidean). The 
vectors f\ , / 2 , / 3 are called the base vectors or bases of the vector space V 3 . Any set 
of vectors, such as the f h which can serve as the bases or base vectors of V 3 is 
called complete, and we say it spans the linear vector space. The base vectors are 
also linearly independent because no relation of the form 



c i/i + c 2fi + c if?, — 0 



(1.26) 



exists between them, unless c\ = c 2 = c 3 = 0. 

The notion of a vector space is much more general than the real vector space 
V 3 . Extending the concept of V 3 , it is convenient to call an ordered set of n 
matrices, or functions, or operators, a ‘vector’ (or an «-vector) in the n-dimen- 
sional space V„. Chapter 5 will provide justification for doing this. Taking a cue 
from V 3 , vector addition in V„ is defined to be 

(*i , • • • ,x n ) + (ji , . . . , y„) = (x, +y i , . . . , x n + v„) (1.27) 

and multiplication by scalars is defined by 

a(x u ...,x n ) = (ax u ...,ax n ), (1.28) 
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where a is real. With these two algebraic operations of vector addition and multi- 
plication by scalars, we call V n a vector space. In addition to this algebraic 
structure, V„ has geometric structure derived from the length defined to be 




(1.29) 



The dot product of two «-vectors can be defined by 

n 

(xi , ■ ■ • , x„) • (t’l , . . . , y n ) = ^2 x jyj ■ ( 1 - 30 ) 

7=1 



In V„, vectors are not directed line segments as in F 3 ; they may be an ordered set 
of n operators, matrices, or functions. We do not want to become sidetracked 
from our main goal of this chapter, so we end our discussion of vector space here. 



Vector differentiation 

Up to this point we have been concerned mainly with vector algebra. A vector 
may be a function of one or more scalars and vectors. We have encountered, for 
example, many important vectors in mechanics that are functions of time and 
position variables. We now turn to the study of the calculus of vectors. 

Physicists like the concept of field and use it to represent a physical quantity 
that is a function of position in a given region. Temperature is a scalar field, 
because its value depends upon location: to each point (x, y, z) is associated a 
temperature T(x,y,z). The function T(x,y,z) is a scalar field, whose value is a 
real number depending only on the point in space but not on the particular choice 
of the coordinate system. A vector field, on the other hand, associates with each 
point a vector (that is, we associate three numbers at each point), such as the wind 
velocity or the strength of the electric or magnetic field. When described in a 
rotated system, for example, the three components of the vector associated with 
one and the same point will change in numerical value. Physically and geo- 
metrically important concepts in connection with scalar and vector fields are 
the gradient, divergence, curl, and the corresponding integral theorems. 

The basic concepts of calculus, such as continuity and differentiability, can be 
naturally extended to vector calculus. Consider a vector A, whose components are 
functions of a single variable u. If the vector A represents position or velocity, for 
example, then the parameter u is usually time t, but it can be any quantity that 
determines the components of A. If we introduce a Cartesian coordinate system, 
the vector function A (u) may be written as 

A(i/) — A[(l/)cj-t-A2(//)C2“t“A3(i/)C3. (1.31) 
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A(m) is said to be continuous at u = m 0 if it is defined in some neighborhood of 
« 0 and 



lim A(u) = A(u 0 ). (1-32) 



Note that A (u) is continuous at m 0 if and only if its three components are con- 
tinuous at u 0 . 

A (u) is said to be differentiable at a point u if the limit 

dA(u) _ A(u+ Au) - A (u) 
du a«— >o Am 

exists. The vector A'(u) = dA(u)/du is called the derivative of A(m); and to differ- 
entiate a vector function we differentiate each component separately: 

A (m) — A[(m)C] 4~ A 2 (u )(?2 T (1.33a) 

Note that the unit coordinate vectors are fixed in space. Higher derivatives of A («) 
can be similarly defined. 

If A is a vector depending on more than one scalar variable, say u, v for 
example, we write A = A (u,v). Then 

dA = (dA/du)du+ (dA/dv)dv (1-34) 

is the differential of A, and 



dA A(m + Am, v) — A(u, v) 

IT = lim — - 

au Aw— »o ou 



(1.34a) 



and similarly for dA/dv. 

Derivatives of products obey rules similar to those for scalar functions. 
However, when cross products are involved the order may be important. 



Space curves 

As an application of vector differentiation, let us consider some basic facts about 
curves in space. If A(m) is the position vector r(u) joining the origin of a coordinate 
system and any point P(x l ,x 2 ,x 3 ) in space as shown in Fig. 1.11, then Eq. (1.31) 
becomes 

r(u) = X\{u)e\ + x 2 {u)e 2 + x 3 (u)e 3 . (1-35) 

As u changes, the terminal point P of r describes a curve C in space. Eq. (1.35) is 
called a parametric representation of the curve C, and u is the parameter of this 
representation. Then 

Ar / r (m + Am) — r(u)\ 

Am \ Am / 
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z 




Figure 1.11. Parametric representation of a curve. 



is a vector in the direction of Ar, and its limit (if it exists) dr/du is a vector in the 
direction of the tangent to the curve at (x\ , x 2 , x 3 ). If u is the arc length 5 measured 
from some fixed point on the curve C, then dr/ds = T is a unit tangent vector to 
the curve C. The rate at which T changes with respect to ^ is a measure of the 
curvature of C and is given by d T Ids. The direction of dT Ids at any given point on 
C is normal to the curve at that point: T ■ T = 1, d(T ■ T)/ds = 0, from this we 
get T ■ dT /ds = 0, so they are normal to each other. If TV is a unit vector in this 
normal direction (called the principal normal to the curve), then dT/ds— kN, 
and k is called the curvature of C at the specified point. The quantity p = l/n is 
called the radius of curvature. In physics, we often study the motion of particles 
along curves, so the above results may be of value. 

In mechanics, the parameter u is time t, then dr/dt = v is the velocity 
of the particle which is tangent to the curve at the specific point. Now we 
can write 



dr dr ds - 
v = -j- = -r ~r — vT 
dt ds dt 

where v is the magnitude of v, called the speed. Similarly, a = d\/dt is the accel- 
eration of the particle. 



Motion in a plane 

Consider a particle P moving in a plane along a curve C (Fig. 1.12). Now r = re r , 
where e r is a unit vector in the direction of r. Flence 

dr dr „ de r 

v = — = —e r + r—. 
dt dt dt 
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Now de r /dt is perpendicular to Also \de r /dt\ = dO/dt; we can easily verify this 
by differentiating e r = cos Qe x + sin 0e 2 - Hence 



dr dr „ 
v = — = — e,. 
dt dt 



dO „ 

+ r * v ’ 



e g is a unit vector perpendicular to e r . 
Dilferentiating again we obtain 



d\ 

a = — = 



d 2 r 



dt dt 2 



drde r drdO „ d 9 „ dd „ 

Jt^i + Jtti ee + r W ee + r dt ee 



Thus 



d 2 r „ 
= d? Cr 



dr dO , 
(ft (ft 



d 2 d ^ 

r W ee ~ r 




2 

e r 



de g dd , \ 

dt dt ' ) 




1 d 

r dt 




eg- 



A vector treatment of classical orbit theory 

To illustrate the power and use of vector methods, we now employ them to work 
out the Keplerian orbits. We first prove Kepler’s second law which can be stated 
as: angular momentum is constant in a central force field. A central force is a force 
whose line of action passes through a single point or center and whose magnitude 
depends only on the distance from the center. Gravity and electrostatic forces are 
central forces. A general discussion on central force can be found in, for example, 
Chapter 6 of Classical Mechanics , Tai L. Chow, John Wiley, New York, 1995. 

Dilferentiating the angular momentum L = r x p with respect to time, we 
obtain 

dL/c/t = dr/dt x p + r x dp/dt. 
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The first vector product vanishes because p = mdr/dt so dr/dt and p are parallel. 
The second vector product is simply r x F by Newton’s second law, and hence 
vanishes for all forces directed along the position vector r, that is, for all central 
forces. Thus the angular momentum L is a constant vector in central force 
motion. This implies that the position vector r, and therefore the entire orbit, 
lies in a fixed plane in three-dimensional space. This result is essentially Kepler’s 
second law, which is often stated in terms of the conservation of area velocity, 
|L|/2 m. 

We now consider the inverse-square central force of gravitational and electro- 
statics. Newton’s second law then gives 

mdy/dt=—(k/r 2 )n, (1-36) 

where n = r/r is a unit vector in the r-direction, and k = Gm i m 2 for the gravita- 
tional force, and k = q x q 2 for the electrostatic force in cgs units. First we note that 

v = dr/dt = dr/dth + rdh/dt. 

Then L becomes 

L = r x (tm) = mr 2 [h x (dh/dt)}. (1-37) 

Now consider 

d d\ k k 2 

— (v x L) = — xL= =■ (n x L) = =■ n x mr (n x dn dt)\ 

dt dt mr mr 

= — k[h(dh/dt ■ n) — ( dn/dt)(h • «)]. 



Since h • h = 1, it follows by differentiation that ft ■ dn/dt = 0. Thus we obtain 

— (v x L) = kdh/dt\ 



integration gives 



v x L = kh + C, 



(1.38) 



where C is a constant vector. It lies along, and fixes the position of, the major axis 
of the orbit as we shall see after we complete the derivation of the orbit. To find 
the orbit, we form the scalar quantity 

L 2 = L • (r x my) = mr ■ (v x L) = mr(k + Ceos 9), (1-39) 



where 6 is the angle measured from C (which we may take to be the v-axis) to r. 
Solving for r, we obtain 



L 2 /km A 

1 + C/(k cos 9) 1+ecosiF 



(1.40) 



Eq. (1.40) is a conic section with one focus at the origin, where e represents the 
eccentricity of the conic section; depending on its values, the conic section may be 
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a circle, an ellipse, a parabola, or a hyperbola. The eccentricity can be easily 
determined in terms of the constants of motion: 

e = ^ = ^|(v x L) — kh\ 

= j [|v x L| 2 + k 1 — 2 kh • (v x L)] '^ 2 

Now |v x L| 2 = v 2 Lr because v is perpendicular to L. Using Eq. (1.39), we obtain 



1 


2 r 2 1 2 'ZkL 


1/2 


", 2 L 2 f\ 2 k\ 


1/2 


2 L 2 E 


£ ~k 


v z L z + k z 

mr 


= 




= 


mk 2 



where E is the constant energy of the system. 



Vector differentiation of a scalar field and the gradient 



Given a scalar field in a certain region of space given by a scalar function 
<j>{x\,x 2 ,x 3 ) that is defined and differentiable at each point with respect to the 
position coordinates (x 3 ,x 2 ,x 3 ), the total differential corresponding to an infini- 
tesimal change dr = (dx \ , dx 2 , dx 3 ) is 



, , dcj) dcj) drj> 

dtp = — — dx i + — — dx 2 + — dx 3 . 

OX\ 0X2 VX3 

We can express d<j> as a scalar product of two vectors: 



d(j> 



d (j) 



d(j> = — — dx ] + — — dx 2 + — — dx 3 = (V0) • dr, 

OX 1 OX 2 0X3 



(1.41) 



(1.42) 



where 



dcj) „ d(j) „ d(j> , 

V< ^ = e i +-x—e 2 +—e 3 
OX 1 OX 2 oX^ 



(1.43) 



is a vector field (or a vector point function). By this we mean to each point 
r = (xi,x 2 ,x 3 ) in space we associate a vector as specified by its three compo- 
nents (d(j>/dx\,d(j)/dx 2 , d(j)/dx 3 ): V<f> is called the gradient of <j> and is often written 
as grad (j>. 

There is a simple geometric interpretation of V<j>. Note that <j>(x i,x 2 ,x 3 ) = c, 
where c is a constant, represents a surface. Let r = x\e\ + x 2 e 2 + x 3 e 3 be the 
position vector to a point P(x i,x 2 ,x 3 ) on the surface. If we move along the 
surface to a nearby point Q ( r + dr), then dr = dx x e \ + dx 2 e 2 + dx 3 e 3 lies in the 
tangent plane to the surface at P. But as long as we move along the surface (j) has a 
constant value and d<j) = 0. Consequently from (1.41), 

dr ■ V0 = 0. (1.44) 
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Figure 1.13. Gradient of a scalar. 



Eq. (1.44) states that V0 is perpendicular to hr and therefore to the surface (Fig. 
1.13). Let us return to 

d(j) = (V0) • dr. 

The vector V0 is fixed at any point P , so that dcj), the change in (j>, will depend to a 
great extent on hr. Consequently d<fi will be a maximum when dr is parallel to V0, 
since hr • V0 = |hr||V0| cos 9, and cos 0 is a maximum for 0 = 0. Thus is in 
the direction of maximum increase of cj>(x i,x 2 ,x 3 ). The component of in the 
direction of a unit vector u is given by V</> • u and is called the directional deri- 
vative of (f) in the direction u. Physically, this is the rate of change of </> at 
(X| , x 2 . x 3 ) in the direction u. 



Conservative vector field 

By definition, a vector field is said to be conservative if the line integral of the 
vector along any closed path vanishes. Thus, if F is a conservative vector field 
(say, a conservative force field in mechanics), then 

j) F • hs = 0, (1-45) 

where hs is an element of the path. A (necessary and sufficient) condition for F 
to be conservative is that F can be expressed as the gradient of a scalar, say 
</>: F = —grad ft: 

rb nb nb 

/ F • hs = — / grad • hs = - / dcf) = <j>(a) - 4>{b)'. 

J a J a J a 

it is obvious that the line integral depends solely on the value of the scalar </> at the 
initial and final points, and j>F ■ ds = — j> grad 4> • hs = 0. 
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The vector differential operator V 



We denoted the operation that changes a scalar field to a vector field in Eq. (1.43) 
by the symbol V (del or nabla): 



V = 



d „ d „ d „ 

+ a — + a~ e 3> 

OX [ OX 2 CUC3 



(1.46) 



which is called a gradient operator. We often write V0 as grad </>, and the vector 
field V^>(r) is called the gradient of the scalar field 0(r). Notice that the operator 
V contains both partial differential operators and a direction: it is a vector differ- 
ential operator. This important operator possesses properties analogous to those 
of ordinary vectors. It will help us in the future to keep in mind that V acts both 
as a differential operator and as a vector. 



Vector differentiation of a vector field 

Vector differential operations on vector fields are more complicated because of the 
vector nature of both the operator and the field on which it operates. As we know 
there are two types of products involving two vectors, namely the scalar and 
vector products; vector differential operations on vector fields can also be sepa- 
rated into two types called the curl and the divergence. 



The divergence of a vector 

If V(.\'i,x 2 , v 3 ) = Vfi + V 2 e 2 + V 3 e 2 is a differentiable vector field (that is, it is 
defined and differentiable at each point (x[,x 2 ,x 2 ) in a certain region of space), 
the divergence of V, written V • V or div V, is defined by the scalar product 

( d d d \ 

j—ii + —e 2 +—e 3 ) ■ (Vf, + V 2 e 2 + V 3 e 2 ) 

OX j OX 2 (XX 3 J 



dV 1 dV 2 dV 3 
dx\ dx 2 + dx 2 



(1.47) 



The result is a scalar field. Note the analogy with A • B = A\B X + A 2 B 2 + A 2 B 3 , 
but also note that V • V f V • V (bear in mind that V is an operator). V • V is a 
scalar differential operator: 



V- V = 



Vi 



dx x 



' dx 2 



V , 



dx-i ' 



What is the physical significance of the divergence? Or why do we call the scalar 
product V • V the divergence of V? To answer these questions, we consider, as an 
example, the steady motion of a fluid of density p{x \ , x 2 , x 3 ), and the velocity field 
is given by v(x 1; x 2 ,x 3 ) = v l (x 1 ,x 2 ,x 3 )e 1 + v 2 (x 1 ,x 2 ,x 3 )e 2 + v 3 (x 1 ,x 2 ,x 3 )e 3 . We 
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now concentrate on the flow passing through a small parallelepiped ABCDEFGH 
of dimensions dx ] dx 2 dx 3 (Fig. 1.14). The x, and x 3 components of the velocity v 
contribute nothing to the flow through the face ABCD. The mass of fluid entering 
ABCD per unit time is given by pv 2 dxplx 2 and the amount leaving the face EFGH 
per unit time is 



pv 2 + 



djpvi) 

dx 2 



dx 2 



dx\dx 3 . 



So the loss of mass per unit time is [d(pv 2 ) / dx 2 \dx\dx 2 dx 2 . Adding the net rate of 
flow out all three pairs of surfaces of our parallelepiped, the total mass loss per 
unit time is 



' d 

dx\ 



(pv 1 ) + 



d 

dx 2 



(pv 2 ) + 



£CI < ' Wj) 



dx] dx 2 dx 2 = V • (p\)dx]dx 2 dx 2 . 



So the mass loss per unit time per unit volume is V • (pv). Hence the name 
divergence. 

The divergence of any vector V is defined as V • V. We now calculate V • (/V), 
where /is a scalar: 



v -(/ v ) = ^w) 



d 



(jy 2 ) 



=/(A 1 



\dxi 



dx 2 

dVi dVi 
dx 2 dx 2 



dx 



V, 



(jy 3 ) 

dxi 



Vi 



Of 

dx 2 



V , 



Of 

dx 2 



or 

V-(/V)=/V-V + V-V/. (1.48) 

It is easy to remember this result if we remember that V acts both as a differential 
operator and a vector. Thus, when operating on / V, we first keep / fixed and let V 




Figure 1.14. Steady flow of a fluid. 
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operate on V, and then we keep V fixed and let V operate on /( V • / is nonsense), 
and as V/ and V are vectors we complete their multiplication by taking their dot 
product. 

A vector V is said to be solenoidal if its divergence is zero: V • V = 0. 



The operator V 2 , the Laplacian 



The divergence of a vector field is defined by the scalar product of the operator V 
with the vector field. What is the scalar product of V with itself? 



V- = V • V = 



d_ 

d.X] 



-e\ 



_d_ 

dx~> 



e 2 



_d_ 

dx-> 



e 3 



d 

dx { 



e \ 



_d_ 

8Xn 



e 2 



_d_ 

dx? 



e 3 



dx\ 

This important quantity 



dx\ 



dx\ 



V 2 = 



d 1 

dx\ 



dx\ 



dx\ 



(1.49) 



is a scalar differential operator which is called the Laplacian, after a French 
mathematician of the eighteenth century named Laplace. Now, what is the diver- 
gence of a gradient? 

Since the Laplacian is a scalar differential operator, it does not change the 
vector character of the field on which it operates. Thus V 2 </>( r) is a scalar field 
if (j>( r) is a scalar field, and V 2 [V^(r)] is a vector field because the gradient V^(r) 
is a vector field. 

The equation V 2 <(> = 0 is called Laplace’s equation. 



The curl of a vector 



If V(.Vj , x 2 i xfj is a differentiable vector field, then the curl or rotation of V, 
written V x V (or curl V or rot V), is defined by the vector product 



curl V = V x V = 



<T 


e 2 


^3 


d 


d 


d 


dx { 


dx 2 


dx 3 


Vi 


V 2 


V 3 



+ , (dVi _dV l \ (dVi_dVi\ 

dx 2 dx 3 ) e2 \9x 3 dx\) e>3 VcLt] dx 2 ) 



ij,k 



dVk 

dxj ' 



(1.50) 
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The result is a vector field. In the expansion of the determinant the operators 
d/dxj must precede V t \ Ylijk stands for an( J £ ijk are the permutation 

symbols: an even permutation of ijk will not change the value of the resulting 
permutation symbol, but an odd permutation gives an opposite sign. That is, 

£ ijk £ jki £ kij £ jik £ kji £ ikji a a d 

£ij k = 0 if two or more indices are equal. 



A vector V is said to be irrotational if its curl is zero: V x V(r) = 0. From this 
definition we see that the gradient of any scalar field <j>( r) is irrotational. The proof 
is simple: 



V x (V0) 



<?i 


e 2 


O 


d 


d 


d 


dxi 


dx 2 


dx 3 


d 


d 


d 


dx\ 


dx 2 


dx 3 



0(xi,x 2 ,x 3 ) = 0 



(1.51) 



because there are two identical rows in the determinant. Or, in terms of the 
permutation symbols, we can write V x (V<(>) as 

d d 

V x (V0) = ^2,e ijk e i ——(j){x l ,X2,x- i ). 

ijk J k 

Now £jj k is antisymmetric in j, k , but d 2 /dxjdx k is symmetric, hence each term in 
the sum is always cancelled by another term: 

d d dd 

£ ijk r\ H f £jkj r\ r\ 1 : 

OXj UXfc VXk UXj 

and consequently V x (V0) = 0. Thus, for a conservative vector field F, we have 
curl F = curl (grad </>) = 0. 

We learned above that a vector V is solenoidal (or divergence-free) if its diver- 
gence is zero. From this we see that the curl of any vector field V(r) must be 
solenoidal: 

V - (V x V) = ^ A(V x V), = E A (p#!; = »• 0-52) 

because £ ijk is antisymmetric in i, j. 

If </>( r) is a scalar field and V(r) is a vector field, then 

V x (0V) = </>(V x V) + (V0) x V. (1.53) 
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We first write 



<h 


<?2 




0 


d 


d 


dx\ 


dx 2 


0x 3 


0Fi 


0F 2 


0F 3 


8V: 

•♦5E 


> <90 


-V 2 , 


dx\ 



V x (0V) = 



then notice that 

so we can expand the determinant in the above equation as a sum of two deter- 
minants: 



V x (0V) = 0 



<?i 


h 






<?i 


<?2 


h 


d 


d 


d 




00 


00 


00 


dx\ 


dx 2 


0X 3 


+ 


0X| 


0X 2 


0X 3 


y, 


v 2 


Vj 




Vi 


f 2 


V 3 



= 0(V x V) + (V0) x V. 



Alternatively, we can simplify the proof with the help of the permutation symbols 

^ijk • 

v x (0v) = 

ij,k ox i 

- <t>Y^ k£i dx. + \ Ei * ei dx. Vk 

l,J,K J IJ,K J 

= 0(V x V) + (V0) x V. 



A vector field that has non-vanishing curl is called a vortex field, and the curl of 
the field vector is a measure of the vorticity of the vector field. 

The physical significance of the curl of a vector is not quite as transparent as 
that of the divergence. The following example from fluid flow will help us to 
develop a better feeling. Fig. 1.15 shows that as the component v 2 of the velocity 
v of the fluid increases with x 3 , the fluid curls about the x r axis in a negative sense 
(rule of the right-hand screw), where dv 2 /dx 3 is considered positive. Similarly, a 
positive curling about the X] -axis would result from v 3 if dv 3 /dx 2 were positive. 
Therefore, the total X] component of the curl of v is 

[curl v] L = dv 3 /(dx 2 — dv 2 /dx 3 , 

which is the same as the Xj component of Eq. (1.50). 
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*3 




Figure 1.15. Curl of a fluid flow. 



Formulas involving V 

We now list some important formulas involving the vector differential operator V, 
some of which are recapitulation. In these formulas, A and B are differentiable 
vector field functions, and / and g are differentiable scalar field functions of 
position (x l ,x 2 ,x i ): 



(1) v(/g) =JVg + gVf ; 

(2) V • (/A) =/V • A + V/ • A; 

(3) Vx (/A)=/Vx A + V/xA; 

(4) V x (V/) = 0; 

(5) V • (V x A) = 0; 

(6) V ■ (A x B) = (V x A) ■ B - (V x B) x A; 

(7) V x (A x B) = (B • V)A - B(V • A) + A(V ■ B) - (A • V)B; 

(8) V x (V x A) = V(V • A) - V 2 A; 

(9) V(A ■ B) = A x (V x B) + B x (V x A) + (A • V)B + (B • V)A; 

(10) (A • V)r = A; 

(11) V ■ r = 3; 



(12) V x r = 0; 

(13) V • (U 3 r) = 0; 

d¥ 

(14) d F = ( dr • V)F + —dt (F a differentiable vector field quantity); 



(15) dip = dr ■ Vip + 




(ip a differentiable scalar field quantity). 



Orthogonal curvilinear coordinates 

Up to this point all calculations have been performed in rectangular Cartesian 
coordinates. Many calculations in physics can be greatly simplified by using, 
instead of the familiar rectangular Cartesian coordinate system, another kind of 
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system which takes advantage of the relations of symmetry involved in the parti- 
cular problem under consideration. For example, if we are dealing with sphere, we 
will find it expedient to describe the position of a point in sphere by the spherical 
coordinates (r,Q,<j)). Spherical coordinates are a special case of the orthogonal 
curvilinear coordinate system. Let us now proceed to discuss these more general 
coordinate systems in order to obtain expressions for the gradient, divergence, 
curl, and Laplacian. Let the new coordinates u\ , u 2 , u 3 be defined by specifying the 
Cartesian coordinates (x\ , x 2 , x 3 ) as functions of (u\ , u 2 . u 3 )\ 

x l = f( u U u 2i u i)t x 2 = g( u u u 2i u n)i x 3 = h{U \ , U 2 t W 3 ) , (1-54) 

where /, g, h are assumed to be continuous, differentiable. A point P (Fig. 1.16) in 
space can then be defined not only by the rectangular coordinates (xi,x 2 ,x 3 ) but 
also by curvilinear coordinates ( u\,u 2 ,u 3 ). 

If u 2 and u 3 are constant as U\ varies, P (or its position vector r) describes a curve 
which we call the u\ coordinate curve. Similarly, we can define the u 2 and u 3 coordi- 
nate curves through P. We adopt the convention that the new coordinate system is a 
right handed system, like the old one. In the new system dr takes the form: 

dr Dr dr 

dr = — — du\ + — — du 2 + 7 T — du 3 ■ 

OU\ VU2 

The vector dr/du\ is tangent to the u { coordinate curve at P. If u \ is a unit vector 
at P in this direction, then u { = dr/du\/\dr/du\\, so we can write dr/du\ = h\U\, 
where /i, = | dr/du\\. Similarly we can write dr/du 2 = h 2 u 2 and dr/du 3 = h 3 u , 
where h 2 = \dr/du 2 \ and h 3 = \dr/du 3 \, respectively. Then dr can be written 

dr = h\du\U\ + h 2 du 2 u 2 + h 3 du 3 u 3 . (1.55) 




Figure 1.16. Curvilinear coordinates. 
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The quantities h u h 2 ,h 3 are sometimes called scale factors. The unit vectors U \ , u 2 , 
w 3 are in the direction of increasing w, . u 2 , u 3 , respectively. 

If U] , u 2 , m 3 are mutually perpendicular at any point P , the curvilinear coordi- 
nates are called orthogonal. In such a case the element of arc length ds is given by 

ds = d r • dr = h^duj T h 2 du~> T h 3 du 3 . (1.56) 

Along a U[ curve, u 2 and u 3 are constants so that dr = h\du\U\. Then the 
differential of arc length ds ] along u { at P is h\ du [ . Similarly the differential arc 
lengths along u 2 and m 3 at P are ds 2 = h 2 du 2 , ds 3 = h 3 du 3 respectively. 

The volume of the parallelepiped is given by 

dV = \{h\du\U\) ■ ( h 2 du 2 u 2 ) x {h 3 du 3 u 3 )\ = hih 2 h 3 duidu 2 du 3 
since \u\ ■ u 2 x m 3 | = 1. Alternatively dV can be written as 



dV = 



Or 

du\ 



8 r 

du 2 



dr_ 

du 3 



du\du 2 du 2 = 



9(.\-i,x 2 ,x 3 ) 

d(u u u 2 ,u 3 ) 



du t du 2 da 3 . 



(1.57) 



where 





dx { 


dx 2 


dx x 




du 1 


du 2 


du 3 


d(x u x 2 ,x 3 ) _ 


dx 2 


dx 2 


dx 2 


d(Ui,U 2 ,U 3 ) 


du\ 


du 2 


du 3 




dx 3 


dx 3 


dx 3 




du 1 


du 2 


du 3 



is called the Jacobian of the transformation. 

We assume that the Jacobian / ^ 0 so that the transformation (1.54) is one to 
one in the neighborhood of a point. 

We are now ready to express the gradient, divergence, and curl in terms of 
U \ , u 2 , and w 3 . If 0 is a scalar function of u\ , u 2 , and w 3 , then the gradient takes the 
form 



_ 1 . 1 „ 1 <9</> „ 

S7(j) _ grad </> — — — — u\ + — — — u 2 + — — m 3 . 

n\ OU\ U2 OU 2 /?3 UUt, 

To derive this, let 

Vcj) — f\U\ +f 2 ii 2 +/ 3 m 3 , 

where / 1; / 2 ,/ 3 are to be determined. Since 

dr dr dr 

dr = — — du\ + — — du 2 + — — du 3 

OU\ OU 2 (/W 3 

— h\du\U\ -\- h2dii2U2 ~\~ hiduiUi, 



(1.58) 

(1.59) 
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we have 



d(f> = V(/> • dr = h\f\dii\ + h 2 f 2 du 2 + h 3 f 3 du 3 . 



, , _ d<t> , 30 , , dfi , 

Cl(p — — — CIU j ~\~ — — CIU2 ~\~ 77 — clu ^ , 

OU\ &U2 <71/3 

and on equating the two equations, we find 

f _ 1 d(t> 

Ji 70 5 * 3. 

hi oui 

Substituting these into Eq. (1.57), we obtain the result Eq. (1.58). 

From Eq. (1.58) we see that the operator V takes the form 

_ Ml 9 Ih d u 3 9 

V — 73 I- 7 - ^ f , o — • (1.60) 

h\ OU\ /?2 OU2 /?3 

Because we will need them later, we now proceed to prove the following two 
relations: 



(a) |Vm,| = hi \i = 1, 2, 3. 

(b) U\ = h 2 h 3 X7 u 2 x Vm 3 with similar equations for u 2 and u 3 . (1.61) 

Proof: (a) Let </> = Mi in Eq. (1.51), we then obtain Viq = U\/h\ and so 

|Vmj| = \u x \h[ x = /q \ since \u x \ = 1. 

Similarly by letting <j> = u 2 and m 3 , we obtain the relations for i = 2 and 3. 

(b) From (a) we have 

Vm! = U\/h\, Vm 2 = ih / h 2 1 and Vm 3 = u 3 /h 3 . 

Then 



_ _ M 2 X M 3 Ml , , , _ _ 

Vm 2 x Vm 3 = — — — = — — and Mi = « 2 « 3 Vm 2 x Vm 3 . 

« 2«3 « 2«3 

Similarly 

m 2 = /i 3 /iiV« 3 x Vmi and m 3 = h\h 2 Vii\ x Vm 2 . 

We are now ready to express the divergence in terms of curvilinear coordinates. 
If A = A 1 Mi + T 2 m 2 + /1 3 m 3 is a vector function of orthogonal curvilinear coordi- 
nates Mi, m 2 , and m 3 , the divergence will take the form 

V ■ A = div A = — — {h 2 h 3 A x ) + -^ — (h 3 h l A 2 ) + ^ — (h { h 2 A 3 ) . (1.62) 

To derive (1.62), we first write V • A as 

V • A = V • (AjMj) + V • (A 2 u 2 ) + V • (A 3 u 3 ), (1.63) 
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then, because u x = h x /i 2 Vii 2 x V« 3 , we express V • {A x ii x ) as 

V • {A x u x ) = V • (A { h 2 h^S7 u 2 x Vm 3 ) (i/j = h 2 h 3 \/u 2 x Vm 3 ) 



— V (A i h 2 h 3 ) • \7 u 2 x \7 u 3 A \ h 2 h 3 X7 • (V u 2 x V// 3 ) , 



where in the last step we have used the vector identity: V • (0 A) = 
(V</>) ■ A + </>(V x A). Now Vk ( - = Ui/h h i = 1, 2, 3, so V • (A x u x ) can be rewritten 
as 



V • {A x u x ) = V {A x h 2 h 3 ) ■ f- x ^ + 0 = V(A,/; 2 /z 3 ) • 

li 2 h 3 // 2 h 3 



The gradient \7(A l h 2 h 3 ) is given by Eq. (1.58), and we have 
V • (A x u x ) = 



l ~{A x h 2 h 3 )+f^{A x h 2 h 3 ) + f^-{A x h 2 h 3 ) 

tl\ Oli\ /?2 OUj /Z3 (ZW3 



1 



d 



h x h 2 h 3 du ] 



(A x h 2 h 3 ). 



U\ 

h 2 h 3 



Similarly, we have 



1 c) Id 

V • (A 2 u 2 ) = — — (A 2 h 3 h 1 ), and V • (A 3 u 3 ) = — — - — — (A 3 h 2 h x ). 

/? j /^2 ^3 // 1 /?2^3 C^3 

Substituting these into Eq. (1.63), we obtain the result, Eq. (1.62). 

In the same manner we can derive a formula for curl A. We first write it as 



V x A — V x (A x u x T A 2 u 2 T A 3 z/ 3 ) 
and then evaluate V x A 

Now it/ = hjVuj, i = 1, 2, 3, and we express V x {A x u x ) as 



V x (A x u x ) = V x (Aj/qVzq) 

= \/^A[h x 'j x X 7 u x T A x h\X 7 x X 7 u x 

= V(A x h x )x l -^ + 0 

«i 

u x d . t , . u 2 d ... . & 3 d ' 

T - n — (^1^1) + ~r~ n — (A 2 h 2 ) + —— — ( A 3 h 3 ) 

h\ OU\ 1I2 OU2 /Z3 UUt, 

u 2 d , J . u 3 d ... . 

■( A\h x ) ~ -rT--^—{A\h x ), 



U\ 

X lh 



h 3 Ji 1 dw 3 



/zi/; 2 du 2 
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with similar expressions for V x {A 2 u 2 ) and V x (A 3 u 3 ). Adding these together, 
we get V x A in orthogonal curvilinear coordinates: 



V x A = 



u i 

h 2 h 3 



ir 2 {A3h3) ~^ {A2h2 \ 



u 2 

h 3 h 



3«1 L 1 






+ 



U 3 



~Ly 2h2) ^~Ly x ' h \ 



h x h 2 u 

This can be written in determinant form: 





h { u 1 


^2^2 


h 3 u 3 


1 


d 


a 


d 


h\h 2 h 3 


du 1 


du 2 


du 3 




A\h\ 


A.2I12 


A 3 h 3 



_d_ 

du l 



{A 3 h 3 ) 



(1.64) 



(1.65) 



We now express the Laplacian in orthogonal curvilinear coordinates. From 
Eqs. (1.58) and (1.62) we have 

_ 1 dcj) „ l dcj) „ 1 dcj) „ 

V0 = grad cj) = — — «i + — » + - — u 3 , 
h\ ou\ h? 0U2 0U3 



V • A = div A = 



1 






h\h 2 h 3 

If A = V(/>, then A,- = (\ / h^dcj) / du h i = 1, 2, 3; and 



V • A = V • V0 = V- 



1 



h\h 2 h 3 



d ( h 2 h 3 dcf>\ d (h 3 h\ 



du\ \ h\ du\) du 2 \ h 2 du 2 ) du 3 \ h 3 du 3 



d (h\h 2 dcj) 



(1.66) 



Special orthogonal coordinate systems 

There are at least nine special orthogonal coordinates systems, the most common 
and useful ones are the cylindrical and spherical coordinates; we introduce these 
two coordinates in this section. 



Cylindrical coordinates (p, cj), z) 

U\ = p , u 2 = <j), u 3 = z; and u 3 = e p , u 2 = = e,. 

From Fig. 1.17 we see that 

X| = p cos (f>, x 2 = p sin (p, x 3 = z 
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*3 




Figure 1.17. Cylindrical coordinates. 



where 

p > 0, 0 < <j> < 2tt, — oo < z < oo. 

The square of the element of arc length is given by 

ds 2 = h\(dpf + h 2 (d(j>) 2 + h\(dz) 2 . 

To hnd the scale factors we notice that ds 1 = dr ■ dr where 

r = p cos <j>e i + p sin <f>e 2 + ze 2 . 

Thus 

ds 2 = dr ■ dr = (dp) 2 + p 2 (d(j)) 2 + (tfe) 2 . 
Equating the two ds 2 , we find the scale factors: 

hi = h p = 1 ,h 2 = h „ i = p, /? 3 = h- = 1. 



(1.67) 



From Eqs. (1.58), (1.62), (1.64), and (1.66) we hnd the gradient, divergence, curl, 
and Laplacian in cylindrical coordinates: 



9 $ 



19 $ 



9$ 



— ~E~ e p 4 r 7I e <h + ~FT e z, 



dp 

where $ = $(p, is a scalar function; 

1 



9r 



V • A = 



l P w +d w + i^ 



( 1 . 68 ) 



(1.69) 
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where 



and 



From Fig. 



Now 



but 



A — A p e p + + A z e z , 

(' [> 0 c z 

d d d 
dp d(f> dz ’ 

Ap pA(p A z 

2 _ i a ( d<s>\ i a 2 $ a 2 $ 

p dp \ dp) + p 1 d(j) 2 ^ dz 2 

Spherical coordinates (r. 0, <j>) 

«i =r,u 2 = 9 , m 3 = 0; u x = e r , u 2 = e e , w 3 = 

.18 we see that 

Xi = r sin 8 cos <j>, x 2 = r sin 9 sin <j>, x 3 = r cos 9. 
ds 2 = hi (dr) 2 + h\(d9) 2 + h\(dcj)f 



V x A = 



r = r sin 9 cos <f>e \ + r sin 9 sin (j)e 2 + r cos 9e 2 , 




Figure 1.18. Spherical coordinates. 



(1.70) 



(1.71) 
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so 



ds 1 = dr ■ dr = ( dr) 2 + r 2 (dd)' + r 2 sin 2 8(d<f>) 7 . 

Equating the two ds 2 , we find the scale factors: h\ = h r =1, h 2 = hg = r, 
/?3 = = rsin0. We then find, from Eqs. (1.58), (1.62), (1.64), and (1.66), the 

gradient, divergence, curl, and the Laplacian in spherical coordinates: 



- 9 $ 
V$ =e r — 
or 



, 19$ 

e v — oh ' 
r dO 



■ £</> 



1 9$ 

r sin 8 dtp ’ 



(1.72) 



V • A = 



1 

r 2 sin 8 



d d dA a 

sin 8 - (r 2 A r ) + r - (sin 8A e ) + r ; (1.73) 



V x A = 



1 



V 2 $ = 1 



r 2 sin 8 



dr 



r 2 sin 8 



2 9$ 
dr 



d_ 

dO 



re g 


r sin Oe $ 


d 


d 


dd 


dp 


rA r 


r sin 8 A ^ 


-( sin9 S) + 



1 9 2 $ 



(1.74) 



(1.75) 



Vector integration and integral theorems 

Having discussed vector differentiation, we now turn to a discussion of vector 
integration. After defining the concepts of line, surface, and volume integrals of 
vector fields, we then proceed to the important integral theorems of Gauss, 
Stokes, and Green. 

The integration of a vector, which is a function of a single scalar u, can proceed 
as ordinary scalar integration. Given a vector 

A(«) — A | (w)C[ A 2 (u')£2 T A 3 (w)t ?3 , 



then 



/ A{,,)d " = I M “ )d “ + / AMdl ‘ +i >J Mu)l ' u + B ' 

where B is a constant of integration, a constant vector. Now consider the integral 
of the scalar product of a vector A(.v 1 , x 2 ,x 3 ) and dr between the limit 
Pi(x u x 2 ,x 3 ) and P 2 (x u x 2 ,x 3 ): 



35 




VECTOR AND TENSOR ANALYSIS 





(Afi T A 2 e 2 T A-^e^ * (dx\C 1 T dx 2&2 T dx 3 ^ 3 ) 



Ai(x\,x 2 ,x 3 )dx\ 



A 2 {xi,x 2 ,x 3 )dx 2 



f-P 2 

+ / A 3 (x u x 2 ,x 3 )dx 3 . 

JP 1 



Each integral on the right hand side requires for its execution more than a knowl- 
edge of the limits. In fact, the three integrals on the right hand side are not 
completely defined because in the first integral, for example, we do not the 
know value of x 2 and x 3 in A 1 : 

r p 2 

h= A l (x u x 2 ,x 3 )dx l . (1.76) 

Jp 1 

What is needed is a statement such as 



xi=f{x l ),x 3 = g(x l ) (1.77) 

that specifies x 2 , x 3 for each value of x, . The integrand now reduces to 
Ai(xi,x 2 ,x 3 ) = Ai(x\,f(x\),g(x\)) = B\(x\) so that the integral 7) becomes 
well defined. But its value depends on the constraints in Eq. (1.77). The con- 
straints specify paths on the X\X 2 and x 3 X] planes connecting the starting point 
P x to the end point P 2 . The x, integration in (1.76) is carried out along these 
paths. It is a path-dependent integral and is called a line integral (or a path 
integral). It is very helpful to keep in mind that: when the number of integration 
variables is less than the number of variables in the integrand, the integral is not yet 
completely defined and it is path-dependent. However, if the scalar product A • dr is 
equal to an exact differential, A • dr = dp — Vp ■ dr, the integration depends only 
upon the limits and is therefore path-independent: 

rPi rPi 

/ A • dr = / dp = ip 2 — pi ■ 

Jp, Jp, 

A vector field A which has above (path-independent) property is termed conser- 
vative. It is clear that the line integral above is zero along any close path, and the 
curl of a conservative vector held is zero (V x A = V x (Vy>) = 0). A typical 
example of a conservative vector held in mechanics is a conservative force. 

The surface integral of a vector function A('x| . x 2 , X3) over the surface S is an 
important quantity; it is dehned to be 

/ A • d a, 

Js 
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A 




where the surface integral symbol f s stands for a double integral over a certain 
surface 5, and da is an element of area of the surface (Fig. 1 . 19), a vector quantity. 
We attribute to da a magnitude da and also a direction corresponding the normal, 
h, to the surface at the point in question, thus 

da = nda. 

The normal h to a surface may be taken to lie in either of two possible directions. 
But if da is part of a closed surface, the sign of n relative to da is so chosen that it 
points outward away from the interior. In rectangular coordinates we may write 

da = e x da x + e 2 da 2 + e 3 da 3 = e x dx 2 dx 3 + e 2 dx 3 dx ] + e 3 dx x dx 2 . 

If a surface integral is to be evaluated over a closed surface 5, the integral is 
written as 

ji A • da. 

Note that this is different from a closed-path line integral. When the path of 
integration is closed, the line integral is write it as 

A • ds, 

where T specifies the closed path, and ds is an element of length along the given 
path. By convention, ds is taken positive along the direction in which the path is 
traversed. Here we are only considering simple closed curves. A simple closed 
curve does not intersect itself anywhere. 



Gauss’ theorem (the divergence theorem) 

This theorem relates the surface integral of a given vector function and the volume 
integral of the divergence of that vector. It was introduced by Joseph Louis 
Lagrange and was first used in the modern sense by George Green. Gauss’ 
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name is associated with this theorem because of his extensive work on general 
problems of double and triple integrals. 

If a continuous, dilferentiable vector field A is defined in a simply connected 
region of volume V bounded by a closed surface S', then the theorem states that 



V 



A dV = 



j) A • da, 



(1.78) 



where dV = dx]dx 2 dx 3 . A simple connected region Fhas the property that every 
simple closed curve within it can be continuously shrunk to a point without 
leaving the region. To prove this, we first write 



V • MV = 




then integrate the right hand side with respect to x\ while keeping x 2 x 2 constant, 
thus summing up the contribution from a rod of cross section dx 2 dx 2 (Fig. 1.20). 
The rod intersects the surface S at the points P and Q and thus defines two 
elements of area da P and d a 0 : 



dA x 
■ dx\ 



dV = 



j f dx 2 dx 3 




dA | 
dx 1 



dx i = 



j) dx 2 dx 2 




dA i, 



where we have used the relation dA\ = (dAi/dx\)dx\ along the rod. The last 
integration on the right hand side can be performed at once and we have 

~^r dv = y i A i(Q) - A i {P)]dx 2 dx 3 , 



where Ai(Q) denotes the value of A ] evaluated at the coordinates of the point Q, 
and similarly for A\{P). 

The component of the surface element da which lies in the jq -direction is 
dci\ = dx 2 dx 2 at the point Q, and da\ = —dx 2 dx 3 at the point P. The minus sign 



*2 




Figure 1.20. A square tube of cross section dx 2 dx 3 . 



38 




VECTOR INTEGRATION AND INTEGRAL THEOREMS 



arises since the x { component of d a at P is in the direction of negative x { . We can 
now rewrite the above integral as 

[ — -dV = f A\(Q)dci\ + ( A\(P)dci \ , 

Jv ox i J Sg J Sp 

where Sq denotes that portion of the surface for which the X\ component of the 
outward normal to the surface element dci\ is in the positive xq -direction, and S P 
denotes that portion of the surface for which da\ is in the negative direction. The 
two surface integrals then combine to yield the surface integral over the entire 
surface S (if the surface is sufficiently concave, there may be several such as right 
hand and left hand portions of the surfaces): 

[ ^J-dV = (f A l da l . 

Jv OX\ J s 

Similarly we can evaluate the x 2 and x 3 components. Summing all these together, 
we have Gauss’ theorem: 

J v ^ ^2 Aidcij or V • A dV = j> A - da. 

We have proved Gauss’ theorem for a simply connected region (a volume 
bounded by a single surface), but we can extend the proof to a multiply connected 
region (a region bounded by several surfaces, such as a hollow ball). For inter- 
ested readers, we recommend the book Electromagnetic Fields , Roald K. 
Wangsness, John Wiley, New York, 1986. 



Continuity equation 

Consider a fluid of density p(r') which moves with velocity v(r) in a certain region. 
If there are no sources or sinks, the following continuity equation must be satis- 
fied: 

dp(r)/dt + V • j(r) = 0, 

where j is the current 

j(r) = p(r)v(r) 

and Eq. (1.79) is called the continuity equation for a conserved current. 

To derive this important equation, let us consider an arbitrary surface S enclos- 
ing a volume V of the fluid. At any time the mass of fluid within Vis M = f y pdV 
and the time rate of mass increase (due to mass flowing into V) is 




(1.79) 

(1.79a) 
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while the mass of fluid leaving V per unit time is 

/ p\ ■ hds = / V • (py)dV, 

Js Jv 

where Gauss’ theorem is used in changing the surface integral to volume integral. 
Since there is neither a source nor a sink, mass conservation requires an exact 
balance between these effects: 

LP V =-I* i ‘” )dr ’ “ /.( |+ v -w)rfK = °. 

Also since V is arbitrary, mass conservation requires that the continuity equation 

g + v.(„,) = gv.j = o 

must be satisfied everywhere in the region. 



Stokes’ theorem 

This theorem relates the line integral of a vector function and the surface integral 
of the curl of that vector. It was first discovered by Lord Kelvin in 1850 and 
rediscovered by George Gabriel Stokes four years later. 

If a continuous, differentiable vector field A is defined a three-dimensional 
region V, and S' is a regular open surface embedded in V bounded by a simple 
closed curve T, the theorem states that 

J V x A • da = ^ A • dl; (1.80) 

where the line integral is to be taken completely around the curve T and dl is an 
element of line (Fig. 1.21). 




Figure 1.21. Relation between da and dl in defining curl. 
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The surface S, bounded by a simple closed curve, is an open surface; and the 
normal to an open surface can point in two opposite directions. We adopt the 
usual convention, namely the right hand rule: when the fingers of the right hand 
follow the direction of dl, the thumb points in the da direction, as shown in Fig. 

1 . 21 . 

Note that Eq. (1.80) does not specify the shape of the surface S other than that 
it be bounded by F; thus there are many possibilities in choosing the surface. But 
Stokes’ theorem enables us to reduce the evaluation of surface integrals which 
depend upon the shape of the surface to the calculation of a line integral which 
depends only on the values of A along the common perimeter. 

To prove the theorem, we first expand the left hand side of Eq. (1.80); with the 
aid of Eq. (1.50), it becomes 



VxA'rfa = 



f ( dA 1 A dAl A 

/ - — da 2 — da 3 

Js \OX 3 OX 2 



f ( dA 3 dA 3 

/ ~7h rdai 

IS \OX2 OX i 



f (dA 2 dA 2 

/ o — da 3 — — da i 

Is \OXi OX 3 



(1.81) 



where we have grouped the terms by components of A. We next subdivide the 
surface S into a large number of small strips, and integrate the first integral on the 
right hand side of Eq. (1.81), denoted by /j , over one such a strip of width dx x , 
which is parallel to the x 2 x 3 plane and a distance x 3 from it, as shown in Fig. 1.21. 
Then, by integrating over x, , we sum up the contributions from all of the strips. 

Fig. 1.21 also shows the projections of the strip on the x 3 x 3 and X\X 2 planes that 
will help us to visualize the orientation of the surface. The element area da is 
shown at an intermediate stage of the integration, when the direction angles have 
values such that a and 7 are less than 90° and (3 is greater than 90°. Thus, 
da 2 = —dx\dx 3 and da 3 = dx\ dx 2 and we can write 



h = 




8A { 

Ox-, 



dx 9 



dA t 

dx 3 



dx- 



(1.82) 



Note that dx 2 and dx 3 in the parentheses are not independent because x 2 and x 3 
are related by the equation for the surface S and the value of X| involved. Since 
the second integral in Eq. (1.82) is being evaluated on the strip from P to Q for 
which x 1 = const., dx { = 0 and we can add (8A l /dx l )dx l = 0 to the integrand to 
make it dA\. 



d A 1 , dA 1 dA\ _ 

— — dx 1 H — — — dx 2 ~\ — — dx 3 — d A \ . 
Ox j Ox 2 ox 3 



And Eq. (1.82) becomes 

h = ~ [ dx i f dAi = f \A X {P) - Ai{Q)\dx x . 

J strips J P ./strips 
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Next we consider the line integral of A around the lines bounding each of the 
small strips. If we trace each one of these lines in the same sense as we trace the 
path T, then we will trace all of the interior lines twice (once in each direction) and 
all of the contributions to the line integral from the interior lines will cancel, 
leaving only the result from the boundary line 1. Thus, the sum of all of the 
line integrals around the small strips will equal the line integral T of A\. 

I ~ = £ A\dl\. (1.83) 

Similarly, the last two integrals of Eq. (1.81 ) can be shown to have the respective 
values 

j) A 2 dl 2 and 

Substituting these results and Eq. (1.83) 
theorem: 

I x A • da =; (p (yf[d/| Y A 2 dl 2 H~ A 2 dl 2 ^j ;= (p A • c / 1 . 

Js Jr Jr 

Stokes’ theorem in Eq. (1.80) is valid whether or not the closed curve V lies in a 
plane, because in general the surface S is not a planar surface. Stokes’ theorem 
holds for any surface bounded by E. 

In fluid dynamics, the curl of the velocity field v(r) is called its vorticity (for 
example, the whirls that one creates in a cup of coffee on stirring it). If the velocity 
field is derivable from a potential 

v(r) = -V0(r) 

it must be irrotational (see Eq. (1.51)). For this reason, an irrotational flow is also 
called a potential flow, which describes a steady flow of the fluid, free of vortices 
and eddies. 

One of Maxwell’s equations of electromagnetism (Ampere’s law) states that 

V x B = /J„j, 

where B is the magnetic induction, j is the current density (per unit area), and /i 0 is 
the permeability of free space. From this equation, current densities may be 
visualized as vortices of B. Applying Stokes’ theorem, we can rewrite Ampere’s 
law as 

j) B ' dr = Yo j ] ■ da = /u 0 /; 

it states that the circulation of the magnetic induction is proportional to the total 
current I passing through the surface S enclosed by T. 



(j) A 2 dl 2 . 

into Eq. (1.81) we obtain Stokes’ 
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Green’s theorem 

Green’s theorem is an important corollary of the divergence theorem, and it 
has many applications in many branches of physics. Recall that the divergence 
theorem Eq. (1.78) states that 

/ V • A dV = </> A • c/a. 



Let A = ipB, where ip is a scalar function and B a vector function, then V • A 
becomes 

V • A = V • {ip B) = Ip'S/ ■ B + B • VV’. 

Substituting these into the divergence theorem, we have 



ipB ■ c/a = / (V>V ■ B + B • V ip)dV. 



(1.84) 



If B represents an irrotational vector field, we can express it as a gradient of a 
scalar function, say, 

B = X7tp. 

Then Eq. (1.84) becomes 

<p ipB ■ c/a = f [ipV • (Vp) + (Vip) ■ (Vip)\dV. (1.85) 

Js ./[' 



Now 



B c/a = (Vtp) ■ hda. 



The quantity (V<£>) • h represents the rate of change of <j> in the direction of the 
outward normal; it is called the normal derivative and is written as 

(V<p) ■ h = dtp/dn. 

Substituting this and the identity V • (Vy>) = V 2 <p into Eq. (1.85), we have 

(£ip^f-da = f [ipV 2 ip + V<£> • Vip\dV. (1.86) 

Js dn J v 

Eq. (1.86) is known as Green’s theorem in the first form. 

Now let us interchange ip and ip, then Eq. (1.86) becomes 

(fp^da= f [ip\7 2 ip +V(p ■ \7tp\dV. 

JS V n Jv 

Subtracting this from Eq. (1.85): 

f s ~ ^^) dCl = jy ~ ^ 2 ^) dV - ( L8? ) 
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This important result is known as the second form of Green’s theorem, and has 
many applications. 



Green’s theorem in the plane 

Consider the two-dimensional vector held A = M(x\,x 2 )e t + N(x i,x 2 )e 2 . From 
Stokes’ theorem 



A - dr = / V x A ■ ha = 



(188) 



which is often called Green’s theorem in the plane. 

Since j> r A-dr = § v (Mdx\ + Ndx 2 ), Green’s theorem in the plane can be writ- 
ten as 



Mdx\ + Ndx 2 = 



I" (ON dM\ 

Is Wi 9x 2 ) 



dx\dx 2 . 



(1.88a) 



As an illustrative example, let us apply Green’s theorem in the plane to show 
that the area bounded by a simple closed curve T is given by 

- (f x\dx 2 — x 2 dx\. 



Into Green’s theorem in the plane, let us put M = — x 2 , N = x, , giving 

l x ' dx 2 ' X2d " = l d Xi - h^ dx ' dx '- = 2 1 dx ' dx > = 

where A is the required area. Thus A = \ f r x t dx 2 — x 2 dx\. 



Helmholtz’s theorem 

The divergence and curl of a vector held play very important roles in physics. We 
learned in previous sections that a divergence-free held is solenoidal and a curl- 
free held is irrotational. We may classify vector helds in accordance with their 
being solenoidal and/or irrotational. A vector held V is: 

(1) Solenoidal and irrotational if V • V = 0 and V x V = 0. A static electric 
held in a charge-free region is a good example. 

(2) Solenoidal if V • V = 0 but V x V / 0. A steady magnetic held in a current- 
carrying conductor meets these conditions. 

(3) Irrotational if V x V = 0 but V • V = 0. A static electric held in a charged 
region is an irrotational held. 

The most general vector held, such as an electric held in a charged medium with 
a time-varying magnetic held, is neither solenoidal nor irrotational, but can be 
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considered as the sum of a solenoidal field and an irrotational field. This is made 
clear by Helmholtz’s theorem, which can be stated as (C. W. Wong: Introduction 
to Mathematical Physics , Oxford University Press, Oxford 1991; p. 53): 

A vector field is uniquely determined by its divergence and curl in 
a region of space, and its normal component over the boundary 
of the region. In particular, if both divergence and curl are 
specified everywhere and if they both disappear at infinity 
sufficiently rapidly, then the vector field can be written as a 
unique sum of an irrotational part and a solenoidal part. 

In other words, we may write 

V(r) = -Vc/>(r) + V x A(r), (1.89) 

where — V0 is the irrotational part and V x A is the solenoidal part, and <f>( r) and 
A(r) are called the scalar and the vector potential, respectively, of V(r). If both A 
and 4> can be determined, the theorem is verified. How, then, can we determine A 
and <\P. If the vector field V(r) is such that 

V • V(r) = p, and V x V(r) = v, 

then we have 

V • V(r) = p = -V • (V0) + V • (V x A) 
or 

V 2 </> - -p, 

which is known as Poisson’s equation. Next, we have 

V x V(r) = v = V x [-V0 + V x A(r)] 
or 

V 2 A = v; 

or in component, we have 

V 2 A,- = Vi,i =1,2,3 

where these are also Poisson’s equations. Thus, both A and cj) can be determined 
by solving Poisson’s equations. 



Some useful integral relations 

These relations are closely related to the general integral theorems that we have 
proved in preceding sections. 

(1) The line integral along a curve C between two points a and h is given by 

b 

(y4>) ■ dl = <t>{b) — <j>{a). (1.90) 
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Proof: 



(' V<j>) • d\ = J ^ j + k ) • (dxi + dyj + dzk) 



rb ''dcp , dcp dcp 

-ax + —dy + —dz 



( 2 ) 



dx dy ' dz 



r fdcp dx dcp dy dcp dz\ 

J a \cbc dt ^ dy dt dz dt) Ct 

=1 (tH =«'’)-«»)■ 

l d ^da= [ V 2 p>dV. 

Is on J v 



(1.91) 



Proof: Set ip = 1 in Eq. (1.87), then dip/dn = 0 = V 2 ip and Eq. (1.87) reduces to 
Eq. (1.91). 



( 3 ) 



/ V<pdV = ® ipnda. 
' v Js 



(1.92) 



Proof: In Gauss’ theorem (1.78), let A = ipC, where C is constant vector. Then 
we have 



Since 



we have 



[ V • (<pC)dV = f <pC ■ hda. 
Jv Js 



V • (<pC) = V cp C = C Vp and tpC ■ n = C ■ (<pn), 



f C ■ VipdV = f C • ( ipn)da . 
Jv Js 

Taking C outside the integrals, 

C • f S7ipdV = C • f (ipn)da 
Jv Js 

and since C is an arbitrary constant vector, we have 

J \7pdV = j> ipnda. 



( 4 ) 



/ V x B dV = / n x B da 
' v Js 



(1.93) 
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Proof: In Gauss’ theorem (1.78), let A = B x C where C is a constant vector. We 
then have 

[ V • (B x C )dV = [ (B x €) • hda. 

Jv Js 

Since V • (B x C) = C • (V x B) and (B x C) • n = B • (C x h) = (C x n) • B = 
C • (n x B), 

/ C • (V x B )dV = [ C • (n x B )da. 

Jv Js 

Taking C outside the integrals 

C • l (V x B )dV = C [ (fix B )da 
Jv Js 

and since C is an arbitrary constant vector, we have 

[ V x B dV = f h x Bda. 



Tensor analysis 

Tensors are a natural generalization of vectors. The beginnings of tensor analysis 
can be traced back more than a century to Gauss’ works on curved surfaces. 
Today tensor analysis finds applications in theoretical physics (for example, gen- 
eral theory of relativity, mechanics, and electromagnetic theory) and to certain 
areas of engineering (for example, aerodynamics and fluid mechanics). The gen- 
eral theory of relativity uses tensor calculus of curved space-time, and engineers 
mainly use tensor calculus of Euclidean space. Only general tensors are 
considered in this section. The general definition of a tensor is given, followed 
by a concise discussion of tensor algebra and tensor calculus (covariant differen- 
tiation). 

Tensors are defined by means of their properties of transformation under 
coordinate transformation. Let us consider the transformation from one coordi- 
nate system (x 1 ,* 2 , . . . ,x N ) to another {x n ,x' 2 , . . . ,x' N ) in an TV-dimensional 
space V N . Note that in writing x'f the index n is a superscript and should not 
be mistaken for an exponent. In three-dimensional space we use subscripts. We 
now use superscripts in order that we may maintain a ‘balancing’ of the indices in 
all the general equations. The meaning of ‘balancing’ will become clear a little 
later. When we transform the coordinates, their differentials transform according 
to the relation 

<«■ (1.94) 
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Here we have used Einstein’s summation convention: repeated indexes which 
appear once in the lower and once in the upper position are automatically 
summed over. Thus, 

\i=\ 

It is important to remember that indexes repeated in the lower part or upper part 
alone are not summed over. An index which is repeated and over which summa- 
tion is implied is called a dummy index. Clearly, a dummy index can be replaced 
by any other index that does not appear in the same term. 



Contravariant and covariant vectors 



A set of N quantities A'\yL = 1,2 which, under a coordinate change, 
transform like the coordinate differentials, are called the components of a contra- 
variant vector or a contravariant tensor of the first rank or first order: 



A ^0^ A ' V - ^ 

This relation can easily be inverted to express A ,v in terms of A' 1 . We shall leave 
this as homework for the reader (Problem 1.32). 

If N quantities A l \^j,= 1 , 2, . . . , N) in a coordinate system (x 1 , x 2 , . . . , x N ) are 
related to N other quantities A' v (v = 1,2 in another coordinate system 
(x n ,x' 2 , ,x' N ) by the transformation equations 



_dx v 

Afl ~dxf r 



(1.96) 



they are called components of a covariant vector or covariant tensor of the first 
rank or first order. 

One can show easily that velocity and acceleration are contravariant vectors 
and that the gradient of a scalar field is a covariant vector (Problem 1.33). 

Instead of speaking of a tensor whose components are A 11 or A „ we shall simply 
refer to the tensor A M or A „. 



Tensors of second rank 

From two contravariant vectors A 1 ' and B" we may form the N 2 quantities A 1 ' B" . 
This is known as the outer product of tensors. These N 2 quantities form the 
components of a contravariant tensor of the second rank: any aggregate of N 2 
quantities T^ v which, under a coordinate change, transform like the product of 
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two contravariant vectors 






Ox' 1 dx v 
dx"' dx’P 



T’ 



a(h 



( 1 . 97 ) 



is a contravariant tensor of rank two. We may also form a covariant tensor of 
rank two from two covariant vectors, which transforms according to the formula 



dx a dx {i , 
dx» dx v a/3 ‘ 



( 1 . 98 ) 



Similarly, we can form a mixed tensor T ,L V of order two that transforms as 
follows: 



T* 



dx jJ dx 7? , a 
dx' a dx" li 



( 1 . 99 ) 



We may continue this process and multiply more than two vectors together, 
taking care that their indexes are all different. In this way we can construct tensors 
of higher rank. The total number of free indexes of a tensor is its rank (or order). 

In a Cartesian coordinate system, the distinction between the contravariant and 
the covariant tensors vanishes. This can be illustrated with the velocity and 
gradient vectors. Velocity and acceleration are contravariant vectors, they are 
represented in terms of components in the directions of coordinate increase; the 
gradient vector is a covariant vector and it is represented in terms of components 
in the directions orthogonal to the constant coordinate surfaces. In a Cartesian 
coordinate system, the coordinate direction x 1 ' coincides with the direction ortho- 
gonal to the constant - jC 1 surface, hence the distinction between the covariant and 
the contravariant vectors vanishes. In fact, this is the essential difference between 
contravariant and covariant tensors: a covariant tensor is represented by com- 
ponents in directions orthogonal to like constant coordinate surface, and a 
contravariant tensor is represented by components in the directions of coordinate 
increase. 

If two tensors have the same contravariant rank and the same covariant rank, 
we say that they are of the same type. 



Basic operations with tensors 

(1) Equality: Two tensors are said to be equal if and only if they have the same 
covariant rank and the same contravariant rank, and every component of 
one is equal to the corresponding component of the other: 

A a \ = 
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(2) Addition (subtraction): The sum (difference) of two or more tensors of the 
same type and rank is also a tensor of the same type and rank. Addition of 
tensors is commutative and associative. 

(3) Outer product of tensors: The product of two tensors is a tensor whose rank 
is the sum of the ranks of the given two tensors. This product involves 
ordinary multiplication of the components of the tensor and it is called 
the outer product. For example, A „ va B\ = C /iA 1 ' Q/3 is the outer product 
of A p m: and B /3 a . 

(4) Contraction: If a covariant and a contravariant index of a mixed tensor are 

set equal, a summation over the equal indices is to be taken according to the 
summation convention. The resulting tensor is a tensor of rank two less than 
that of the original tensor. This process is called contraction. For example, if 
we start with a fourth-order tensor one way of contracting it is to set 

6 = p, which gives the second rank tensor T'\ p p . We could contract it again 
to get the scalar T p pp p . 

(5) Inner product of tensors: The inner product of two tensors is produced by 
contracting the outer product of the tensors. For example, given two tensors 
A a ^ s and B'\„ the outer product is A^gB^,,. Setting 8 = n, we obtain the 
inner product d a/3 /J fi / ', / . 

(6) Symmetric and antisymmetric tensors: A tensor is called symmetric with 
respect to two contravariant or two covariant indices if its components 
remain unchanged upon interchange of the indices: 

A a 0 = A pa , Aa p = A 0 a- 

A tensor is called anti-symmetric with respect to two contravariant or two 
covariant indices if its components change sign upon interchange of the 
indices: 

^ } -™-a/3 **-(3or 

Symmetry and anti-symmetry can be defined only for similar indices, not 
when one index is up and the other is down. 



Quotient law 

A quantity Q a „ with various up and down indexes may or may not be a tensor. 
We can test whether it is a tensor or not by using the quotient law, which can be 
stated as follows: 

Suppose it is not known whether a quantity A is a tensor or not. 

If an inner product of X with an arbitrary tensor is a tensor, then 
X is also a tensor. 
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As an example, let X = P A , A x be an arbitrary contravariant vector, and A x P X)lv 
be a tensor, say Q A x P Xflv = Q^, then 



A X P XliV = 



dx' a dx' 0 
dx 1 ' dx v 



A' 1 P' 



■yap' 



But 



and so 



A' 1 = 



d 4rA> 

dx x 



x dx' a dx 10 dx' 1 , x , 

x dxP dx v dx x ia0 ' 

This equation must hold for all values of A x , hence we have, after canceling the 
arbitrary A x , 

dx' a dx' 0 dx' 1 , 

Px » v ~ dx? dx v dx x P ia0, 

which shows that P x is a tensor (contravariant tensor of rank 3). 



The line element and metric tensor 

So far covariant and contravariant tensors have nothing to do each other except 
that their product is an invariant: 

A' b'i‘ = ® x ; | a 0 = — — 4 A 0 = 6° -,A A 0 = A A a 
^ dx'» dx 0 “ dx 0 a 0 a a 

A space in which covariant and contravariant tensors exist separately is called 
affine. Physical quantities are independent of the particular choice of the mode 
of description (that is, independent of the possible choice of contravariance or 
covariance). Such a space is called a metric space. In a metric space, contravariant 
and covariant tensors can be converted into each other with the help of the metric 
tensor g That is, in metric spaces there exists the concept of a tensor that may be 
described by covariant indices, or by contravariant indices. These two descrip- 
tions are now equivalent. 

To introduce the metric tensor g /a/ , let us consider the line element in V N . In 
rectangular coordinates the line element (the differential of arc length) ds is given by 

ds 2 = dx 2 + dy 1 + dz 2 = (d.v 1 ) 2 + (dx 2 ) 2 + (dx 3 ) 2 -, 

there are no cross terms dx dx J . In curvilinear coordinates ds 2 cannot be repre- 
sented as a sum of squares of the coordinate differentials. As an example, in 
spherical coordinates we have 

ds 2 = dr 2 + r 2 d0 2 + r 2 sin 2 6d(jr 

which can be in a quadratic form, with x l = r,x 2 = 9, x 3 = (j). 
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A generalization to V N is immediate. We define the line element ds in V N to be 
given by the following quadratic form, called the metric form, or metric 

3 3 

ds 2 = ^ ^ g llv dx IJ dx l/ = g^dx^dx". ( 1 .100) 

[1=1 V= 1 

For the special cases of rectangular coordinates and spherical coordinates, we 
have 

/ 1 0 0\ /I 0 0 \ 

g = (gi.w) = I 0 1 0 , g = (g lw ) = I 0 r 2 0 (1.101) 

\0 0 1 ) \0 0 r 2 sin 2 9 J 

In an A-dimensional orthogonal coordinate system g lu , = 0 for gL ^ v. And in a 
Cartesian coordinate system g = 1 and g /w = 0 for g ^ v. In the general case of 
Riemannian space, the g )W are functions of the coordinates x^(g = 1,2, . . . , N). 

Since the inner product of g jJV and the contravariant tensor dx^dx” is a scalar 
(ds 2 , the square of line element), then according to the quotient law g. M , is a 
covariant tensor. This can be demonstrated directly: 

ds 2 = g a pdx a dx P = g' a pdx' a dx'P . 

Now dx' a = (dx !a / dx IJ )dx IJ , so that 

8x la fix' 13 

S'afi Qxp Q x u dx,l dx V = g, v dx IJ dx V 



or 

/ dx ,a dx 10 \ 

(®'" s av 7 _ S *"J dx “ dx " = °' 

The above equation is identically zero for arbitrary dx' 1 , so we have 

Bx' a dx ' 13 

( U02) 

which shows that g )W is a covariant tensor of rank two. It is called the metric 
tensor or the fundamental tensor. 

Now contravariant and covariant tensors can be converted into each other with 
the help of the metric tensor. For example, we can get the covariant vector (tensor 
of rank one) A fl from the contravariant vector A 1 ': 

A„=g^A v . ( 1 . 103 ) 

Since we expect that the determinant of g jiv does not vanish, the above equations 
can be solved for A v in terms of the A... Let the result be 

A 6 

A u = g^A^. ( 1 . 104 ) 
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By combining Eqs. (1.103) and (1.104) we get 

A = s s va A 

Since the equation must hold for any arbitrary A „, we have 

g,y a = s, a , (l.ios) 

where 6^° is Kronecker’s delta symbol. Thus, g pv is the inverse of g /w and vice 
versa; g is often called the conjugate or reciprocal tensor of g /w . But remember 
that g pv and g are the contravariant and covariant components of the same 
tensor, that is the metric tensor. Notice that the matrix ( g M ") is just the inverse of 
the matrix (g IM/ ). 

We can use g /a , to lower any upper index occurring in a tensor, and use g ,w to 
raise any lower index. It is necessary to remember the position from which the 
index was lowered or raised, because when we bring the index back to its original 
site, we do not want to interchange the order of indexes, in general T IJV ^ T V,J ' . 
Thus, for example 

A% = g rp A rq , A” = g rp g? g A rs , A p rs = g rq A pq s . 



Associated tensors 

All tensors obtained from a given tensor by forming an inner product with the 
metric tensor are called associated tensors of the given tensor. For example, A a 
and A a are associated tensors: 

A a = g a p A0 i Aa = g a ' iA p- 



Geodesics in a Riemannian space 

In a Euclidean space, the shortest path between two points is a straight line 
joining the two points. In a Riemannian space, the shortest path between two 
points, called the geodesic, may be a curved path. To find the geodesic, let us 
consider a space curve in a Riemannian space given by x p = f p {t) and compute 
the distance between two points of the curve, which is given by the formula 



s = 



\J g\ li dx x dx IJ ’ = £ yjgx^dx^x^dt, 



(1.106) 



where dx x = dx x /dt, and t (a parameter) varies from point to point of the geo- 
desic curve described by the relations which we are seeking. A geodesic joining 
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two points P and Q has a stationary value compared with any other neighboring 
path that connects P and Q. Thus, to find the geodesic we extremalize (1.106), and 
this leads to the differential equation of the geodesic (Problem 1.37) 



d ( dF\ dF 

dt \cbc J dx 



(1.107) 



where F = \Jg a pxPxP, and x = dx/dt. Now 



dF _ 1 

H~2 



( ga0 X a X^ 



■PY 1 ! 2 d g a 0 - a .0 dF 1/ , a .A— V 2 , Q 

'' ' L - [gap* X ) 2 g aJ X 



dxt~ X X ' dx- ' 2 



and 



dsjdt = \jg a px a xP . 

Substituting these into (1.107) we obtain 



d 

It [kar 



(g^x a s 1 ) - 



1 dg, 

2 dx 1 



^x a x p s~ x = 0, s = 



ds 

dt 



or 



gayX 



dg, 



a 7 



X X" — - - 



1 dg c 



l-l-a-0 



•a •• •— 1 



dx 13 ' ' 2 dx 7 

We can simplify this equation by writing 



v .r = g ai x ss 



dgaj _^a j c P=-( i 



dx 13 



2 \ dx 13 dx a J 



X"X P, 



then we have 

g ai x a + [a/3, i\x a ^ = g a Y a ss~ l - 



We can further simplify this equation by taking arc length as the parameter t, then 
i = 1 , s = 0 and we have 



where the functions 



d 2 x a 

8ai lY 



+ [a/3, 7 ] 



dx a dx 13 
ds ds 



[a/3, 7 ] 1 3 , 7-7 



1 fdga^l dg pry dg a p\ 

2 \ dx 13 dx a dx 7 J 



(1.108) 



(1.109) 



are called the Christoffel symbols of the first kind. 
Multiplying (1.108) by g PJ , we obtain 



d 2 x p j P 1 dx° dx 13 
ds 2 1 a/3 J ds ds 



( 1 . 110 ) 
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where the functions 



P 

af3 



= T p a f i = g pl WM\ 



( 1 . 111 ) 



are the Christoffel symbol of the second kind. 

Eq. (1.110) is, of course, a set of N coupled differential equations; they are the 
equations of the geodesic. In Euclidean spaces, geodesics are straight lines. In a 
Euclidean space, g a p are independent of the coordinates x 1 ' , so that the Christoffel 
symbols identically vanish, and Eq. (1.110) reduces to 



with the solution 



x p = a p s+ b p , 



where a p and b p are constants independent of s. This solution is clearly a straight 
line. 

The Christoffel symbols are not tensors. Using the defining Eqs. (1.109) and the 
transformation of the metric tensor, we can find the transformation laws of the 
Christoffel symbol. We now give the result, without the mathematical details: 



- dx a dx 0 dx< dx a d 2 x 0 

m A a/3,7 dx^ dx v dx x ^ a(3 dx x dx il dx v ’ 



( 1 . 112 ) 



The Christoffel symbols are not tensors because of the presence of the second term 
on the right hand side. 



Covariant differentiation 

We have seen that a covariant vector is transformed according to the formula 

f) x v 

A = —— A 
M dxP 

where the coefficients are functions of the coordinates, and so vectors at different 
points transform differently. Because of this fact, dA p is not a vector, since it is the 
difference of vectors located at two (infinitesimally separated) points. We can 
verify this directly: 



dA p dA v dx v dx i3 d 2 x' J 

dx 1 dx^ dx 1 ' dx y v dx^dx 1 ’ 



(1.113) 
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which shows that dA/dx ^ are not the components of a tensor because of the 
second term on the right hand side. The same also applies to the differential of 
a contravariant vector. But we can construct a tensor by the following device. 
From Eq. (1.111) we have 



, dx a dx T dx a < 9 V dx° 
aT dx p dx 1 dx p + dx IJ dx< dxZ ' 



Multiplying (1.114) by A a and subtracting from (1.113), we obtain 



dA M 
dx 1 



-a„ r Q 




-A p Y» 




dx° dx? 
dx 1 ' dx 1 ' 



(1.114) 



(1.115) 



If we dehne 



then (1.115) can be rewritten as 

- dx a dx 13 

A ^ = Aa ^d¥dF’ 



which shows that A a .p is a covariant tensor of rank 2. This tensor is called the 
covariant derivative of A a with respect to x r> . The semicolon denotes covariant 
differentiation. In a Cartesian coordinate system, the Christoffel symbols vanish, 
and so covariant differentiation reduces to ordinary differentiation. 

The contravariant derivative is found by raising the index which denotes differ- 
entiation: 



A* a 



g aa A p 



;cr 



(1.117) 



We can similarly determine the covariant derivative of a tensor of arbitrary 
rank. In doing so we find the following simple rule helps greatly: 

To obtain the covariant derivative of the tensor 77.7 with respect to 
x p , we add to the ordinary derivative dTZ /dxf for each covariant 
index v{TZ v ) a term — T a pi ,TZ a ., and for each contravariant index 
v(TZ v "') a term +T“ vp TZ a '" . 

Thus, 



dT. w a a 

r ~r h lL/ pP r r pP r r 

1 nv,a — g x a 1 m 1 1 va 1 nfii 

pc _ Z ” _ T p 4- F^ 

A v.rv r\ /->, i>d J p I nrv u • 



dx' 



Pa 1 v 



The covariant derivatives of both the metric tensor and the Kronnecker delta 
are identically zero (Problem 1.38). 
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Problems 

1.1. Given the vector A= (2,2,— 1) and B= (6,— 3,2), determine: 

(a) 6A — 3B, (b) A 2 + B 2 , (c) A • B, (d) the angle between A and B, (<?) the 
direction cosines of A, (/) the component of B in the direction of A. 

1.2. Find a unit vector perpendicular to the plane of A = (2, — 6, — 3) and 
B = (4, 3,-1). 

1.3. Prove that: 

(a) the median to the base of an isosceles triangle is perpendicular to the 
base; ( b ) an angle inscribed in a semicircle is a right angle. 

1.4. Given two vectors A = (2, 1,-1), B = (1,— 1,2) find: (a) A x B, and (b) a 
unit vector perpendicular to the plane containing vectors A and B. 

1.5. Prove: (a) the law of sines for plane triangles, and (b) Eq. (1.16a). 

1.6. Evaluate (2e x - 3e 2 ) ■ [(g + e 2 - e 3 ) x (3e x - e 3 )]. 

1 .7. (a) Prove that a necessary and sufficient condition for the vectors A, B and 

C to be coplanar is that A • (B x C) = 0. 

( b ) Find an equation for the plane determined by the three points 
Pi(2, —1,1), P 2 (3, 2,-1) and P 3 (— 1, 3, 2). 

1.8. (a) Find the transformation matrix for a rotation of new coordinate system 

through an angle </> about the x 3 ( = r)-axis. 

( b ) Express the vector A = 31?! + 2e 2 + c 3 in terms of the triad e(e 2 e 3 where 
the x[x 2 axes are rotated 45° about the x 3 -axis (the x 3 - and x 3 -axes 
coinciding). 

1.9. Consider the linear transformation A- = ]C)Li e\ ■ ejAj = i Show, 
using the fact that the magnitude of the vector is the same in both systems, 
that 



3 

^ Z X V X ik = 6 Jk 1,2,3). 

i=l 

1.10. A curve C is defined by the parametric equation 

r (m) = X\(u)e i + x 2 {u)e 2 + x 3 («)e 3 , 

where u is the arc length of C measured from a fixed point on C, and r is the 
position vector of any point on C; show that: 

(a) dr /du is a unit vector tangent to C; 

( b ) the radius of curvature of the curve C is given by 
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y 




Figure 1.22. Motion on a circle. 



1.11. (a) Show that the acceleration a of a particle which travels along a space 
curve with velocity v is given by 



dv - vr « 

a = — r + —N, 
dt p 



where f, N, and p are as dehned in the text. 

(b) Consider a particle P moving on a circular path of radius r with constant 
angular speed u = d9/dt (Fig. 1.22). Show that the acceleration a of the 
particle is given by 



a = 



— to r. 



1.12. A particle moves along the curve X\ = 2 1 2 , x 2 = t 2 — 4 1 , x 3 = 3t — 5, where t 
is the time. Find the components of the particle’s velocity and acceleration 
at time t = 1 in the direction e\ — 3e 2 + 2e 3 . 

1.13. (a) Find a unit vector normal to the surface x\ + x\ — x 3 = 1 at the point 

P(FU). 

(, b ) Find the directional derivative of (j> = x]x 2 x 3 + 4x 3 x 3 at (1, —2, — 1) in 
the direction 2e\ — e 2 — 2e 3 . 

1.14. Consider the ellipse given by r, + r 2 = const. (Fig. 1.23). Show that r x and r 2 
make equal angles with the tangent to the ellipse. 

1.15. Find the angle between the surfaces x\ + x 2 + x 3 = 9 and x 3 = x\ + x 2 — 3. 
at the point (2, —1, 2). 

1.16. (a) If / and g are differentiable scalar functions, show that 

V(/g) =/Vg + gV/. 

( b ) Find Vr if r = + x\ + x 3 )^ 2 . 

(c) Show that Vr" = «r" 2 r. 

1.17. Show that: 

{a) V • (r/r 3 ) = 0. Thus the divergence of an inverse-square force is zero. 
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(b) If/ is a differentiable function and A is a differentiable vector function, 
then 

V • (/A) = (V/) • A +/(V • A). 

1.18. (a) What is the divergence of a gradient? 

( b ) Show that V 2 (l/r) = 0. 

(c) Show that r • (V • r) / (rV)r. 

1.19 Given V • E = 0, V • H = 0, V xE = —dH/dt,X/ x H = dE/dt, show that 
E and H satisfy the wave equation V 2 « = d 2 u/dt 2 . 

The given equations are related to the source-free Maxwell’s equations of 
electromagnetic theory, E and H are the electric field and magnetic field 
intensities. 

1 .20. (a) Find constants a, b, c such that 

A = (xi + 2x 2 + ax 3 )ci + (bxi — 3x 2 — x 3 )e 2 + (4xi + cx 2 + 2x 3 )e 3 



is irrotational. 

( b ) Show that A can be expressed as the gradient of a scalar function. 

1.21. Show that a cylindrical coordinate system is orthogonal. 

1 .22. Find the volume element d V in: (a) cylindrical and (b) spherical coordinates. 
Hint: The volume element in orthogonal curvilinear coordinates is 



dV = h\h 2 h 2 du\du 2 diiT, = 



d{x U X2,X 3 ) 

d(u u u 2 ,u 3 ) 



du\du 2 du 3 . 



1 .23. Evaluate the integral (x 2 — y)dx + (y 2 + x)dy along 

(a) a straight line from (0, 1) to (1, 2); 

( b ) the parabola x = t,y = t 2 + 1; 

(c) straight lines from (0, 1) to (1, 1) and then from (1, 1) to (1, 2). 

1.24. Evaluate the integral (x 2 +y 2 )dx along (see Fig. 1.24): 

(a) the straight line y = x, 

(b) the circle arc of radius 1 (x — l) 2 + v 2 = 1. 
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Figure 1.24. Paths for a path integral. 



1.25. Evaluate the surface integral f s A ■ da = f s A ■ hda, where A = x l x 2 e l — 
x 2 e 2 + (*i + x 2 )e 3 , S is that portion of the plane 2x, + 2x 2 + x 3 = 6 
included in the first octant. 

1.26. Verify Gauss’ theorem for A = (2xj — x 3 )e x + xfx 2 e 2 — XiX 2 e 3 taken over 
the region bounded by x, = 0,Xj = l,x 2 = 0, x 2 = l,x 3 = 0,x 3 = 1. 

1.28 Show that the electrostatic field intensity E( r) of a point charge Q at the 
origin has an inverse-square dependence on r. 

1.28. Show, by using Stokes’ theorem, that the gradient of a scalar field is irrota- 
tional: 

V x (V^(r)) = 0. 

Verify Stokes’ theorem for A = (2xi — x 2 )e\ — x 2 x 3 e 2 — x 2 x 3 e 3 , where S is 
the upper half surface of the sphere xj + x 2 + x 3 = 1 and T is its boundary 
(a circle in the x 3 x 2 plane of radius 1 with its center at the origin). 

Find the area of the ellipse x t = a cosd,x 2 = b sin 0. 

Show that J s r x hda = 0, where A is a closed surface which encloses a 
volume V. 

Starting with Eq. (1.95), express A' v in terms of A fl . 

Show that velocity and acceleration are contravariant vectors and that the 
gradient of a scalar field is a covariant vector. 

The Cartesian components of the acceleration vector are 

d 2 x d 2 y d 2 z 

clx = W’ ° y = W ’ Gz = W 

Find the component of the acceleration vector in the spherical polar co- 
ordinates. 

1.35. Show that the property of symmetry (or anti-symmetry) with respect to 
indexes of a tensor is invariant under coordinate transformation. 

1.36. A covariant tensor has components xy, 2_y — z 2 ,xz in rectangular coordi- 
nates, find its covariant components in spherical coordinates. 



1.29. 

1.30. 

1.31. 

1.33. 

1.33. 

1.34. 
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1.37. Prove that a necessary condition that /= f'^ F(t, x, x)dt be an extremum 
(maximum or minimum) is that 

d / 0F\ dF _ 

dt \ dx J dx 

1.38. Show that the covariant derivatives of: (a) the metric tensor, and (b) the 
Kronecker delta are identically zero. 
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Ordinary differential equations 



Physicists have a variety of reasons for studying differential equations: almost all 
the elementary and numerous of the advanced parts of theoretical physics are 
posed mathematically in terms of differential equations. We devote three chapters 
to differential equations. This chapter will be limited to ordinary differential 
equations that are reducible to a linear form. Partial differential equations 
and special functions of mathematical physics will be dealt with in Chapters 10 
and 7. 

A differential equation is an equation that contains derivatives of an 
unknown function which expresses the relationship we seek. If there is only one 
independent variable and, as a consequence, total derivatives like dx/dt , the 
equation is called an ordinary differential equation (ODE). A partial differential 
equation (PDE) contains several independent variables and hence partial deriva- 
tives. 

The order of a differential equation is the order of the highest derivative appear- 
ing in the equation; its degree is the power of the derivative of highest order after 
the equation has been rationalized, that is, after fractional powers of all deriva- 
tives have been removed. Thus the equation 



d 2 y ffy 

dx 2 dx 



+ 2 y — 0 



is of second order and first degree, and 

\J 1 + ( dy/dx ) 3 

is of third order and second degree, since it contains the term (d 2 y/dx 3 ) 2 after it is 
rationalized. 
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A differential equation is said to be linear if each term in it is such that the 
dependent variable or its derivatives occur only once, and only to the first power. 
Thus 



is not linear, but 



A 

dx 3 



4 =° 



dx 



x . dv 

e sin x — + y = In x 
dx 



is linear. If in a linear differential equation there are no terms independent of y, 
the dependent variable, the equation is also said to be homogeneous ; this would 
have been true for the last equation above if the ‘In x’ term on the right hand side 
had been replaced by zero. 

A very important property of linear homogeneous equations is that, if we know 
two solutions v, and y 2 , we can construct others as linear combinations of them. 
This is known as the principle of superposition and will be proved later when we 
deal with such equations. 

Sometimes differential equations look unfamiliar. A trivial change of variables 
can reduce a seemingly impossible equation into one whose type is readily recog- 
nizable. 

Many differential equations are very difficult to solve. There are only a rela- 
tively small number of types of differential equation that can be solved in closed 
form. We start with equations of first order. A first-order differential equation can 
always be solved, although the solution may not always be expressible in terms of 
familiar functions. A solution (or integral) of a differential equation is the relation 
between the variables, not involving differential coefficients, which satisfies the 
differential equation. The solution of a differential equation of order n in general 
involves n arbitrary constants. 



First-order differential equations 

A differential equation of the general form 

% = > or g ( x . y) d y + f ( x > y) dx = o 

dx g(x,y) 

is clearly a first-order differential equation. 



( 2 . 1 ) 



Separable variables 

If f(x,y) and g(x,y) are reducible to P(x') and Q(y), respectively, then we have 

Q{y)dy + P(x)dx = 0. (2.2) 

Its solution is found at once by integrating. 



63 




ORDINARY DIFFERENTIAL EQUATIONS 



The reader may notice that dy/dx has been treated as if it were a ratio of dy and 
dx, that can be manipulated independently. Mathematicians may be unhappy 
about this treatment. But, if necessary, we can justify it by considering dy and 
dx to represent small finite changes by and Sx, before we have actually reached the 
limit where each becomes infinitesimal. 



Example 2.1 

Consider the differential equation 

dy/dx = —y 1 e x . 

We can rewrite it in the following form —dy/}’ 1 = e x dx which can be integrated 
separately giving the solution 

\/y = e x + c, 

where c is an integration constant. 

Sometimes when the variables are not separable a differential equation may be 
reduced to one in which they are separable by a change of variable. The general 
form of differential equation amenable to this approach is 

dy/dx = f(ax + by), (2.3) 

where /is an arbitrary function and a and b are constants. If we let w = ax + by, 
then bdy/dx = dw/dx — a, and the differential equation becomes 

dw/dx — a = bf(w) 

from which we obtain 



dw 

a + bf(w) 



dx 



in which the variables are separated. 



Example 2.2 
Solve the equation 

dy/dx = 8x + 4 y + (2x + y — 1)". 



Solution: Let w = 2x + y, then dy/dx = dw/dx — 2, and the differential equation 

becomes 

dw/dx + 2 = 4vr + ( w — l) 2 



or 

dw/[Aw + (■ w — 1 ) 2 — 2] = dx. 

The variables are separated and the equation can be solved. 
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A homogeneous differential equation which has the general form 

dy/dx =f(y/x) (2.4) 

may also be reduced, by a change of variable, to one with separable variables. 
This can be illustrated by the following example: 

Example 2.3 
Solve the equation 

dy y 2 + xy 
dx x 2 



Solution: The right hand side can be rewritten as {y/x) 2 + (y/x), and hence is a 

function of the single variable 

v = y/x. 



We thus use v both for simplifying the right hand side of our equation, and also 
for rewriting dy/dx in terms of v and x. Now 



dy 

dx 




dv 

v + x — 
dx 



and our equation becomes 



dv 2 

V + X — = V 

dx 



+ V 



from which we have 

dv dx 
v 2 x 

Integration gives 

— - = In x + c or x = Ae~ x ^ y , 
v 

where c and A (= e~ c ) are constants. 

Sometimes a nearly homogeneous differential equation can be reduced to 
homogeneous form which can then be solved by variable separation. This can 
be illustrated by the by the following: 

Example 2.4 
Solve the equation 

dy/dx = (y + x — 5)/(y — 3x — 1 ) . 
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Solution: Our equation would be homogeneous if it were not for the constants 

—5 and —1 in the numerator and denominator respectively. But we can eliminate 
them by a change of variable: 

x = x+ a, y'=y + /3, 



where a and (3 are constants specially chosen in order to make our equation 
homogeneous: 

dy' /dx' = (y' + x')/y' — 3x'. 



Note that dy' I dx' = dy/dx. Trivial algebra yields a 
v = y' /x' , then 



dy' 
dx ' 



d 

dx' 



(. XV ) = V + X 



dv 

dx' 



— —4. Now let 



and our equation becomes 



v + x 



, dv 
dx' 



v + 1 
v-3’ 



or 



v — 3 dx' 

— ? a 7 dv = — r 

— v 2 + Av+\ x 



in which the variables are separated and the equation can be solved by integra- 
tion. 



Example 2.5 
Fall of a skydiver. 



Solution: Assuming the parachute opens at the beginning of the fall, there are 

two forces acting on the parachute: the downward force of gravity mg, and the 
upward force of air resistance kv 2 . If we choose a coordinate system that has y = 0 
at the earth’s surface and increases upward, then the equation of motion of the 
falling diver, according to Newton’s second law, is 

mdv/dt = — mg + kv 2 , 



where m is the mass, g the gravitational acceleration, and k a positive constant. In 
general the air resistance is very complicated, but the power-law approximation is 
useful in many instances in which the velocity does not vary appreciably. 
Experiments show that for a subsonic velocity up to 300 m/s, the air resistance 
is approximately proportional to v 2 . 

The equation of motion is separable: 



mdv 

mg — kv 2 



= dt 
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or, to make the integration easier 



dv 



k 

= dt. 



— ( mg/k ) m 



Now 



1 



1 



1 ( 1 



- (mg/k) ( v + v,) ( v - v t ) 2v, \v - v, v + v, 



1 



where v 2 t = mg/k. Thus 



Integrating yields 



1 ( dv dv 



2v t \v — v t v — v, 



k 

= dt. 

m 



y-ln 

2v t \v + v r 



— 1 + c, 

m 



where c is an integration constant. 

Solving for v we hnally obtain 

(A = ^ + Sex P (- 2 £ ? /u)] 

V[> 1 — Bexp(—2gt/v t ) ’ 

where B = exp(2n r C). 

It is easy to see that as t — » oo, exp(— 2 gt/v,) — > 0, and so v — •> v t ; that is, if he 
falls from a sufficient height, the diver will eventually reach a constant velocity 
given by v h the terminal velocity. To determine the constants of integration, we 
need to know the value of k, which is about 30 kg/m for the earth’s atmosphere 
and a standard parachute. 



Exact equations 

We may integrate Eq. (2.1) directly if its left hand side is the differential du of 
some function u(x,y), in which case the solution is of the form 

u(x,y) = C (2.5) 

and Eq. (2.1) is said to be exact. A convenient test to see if Eq. (2.1) is exact is 
does 

dg(x,y) _ df(x,y) 

Ox dy ' 1 ’ 

To see this, let us go back to Eq. (2.5) and we have 

d[u(x, y)\ = 0. 

On performing the differentiation we obtain 

du , du , 

—dx + —dy = 0. (2.7) 



67 




ORDINARY DIFFERENTIAL EQUATIONS 



It is a general property of partial derivatives of any well-behaved function that 
the order of differentiation is immaterial. Thus we have 



d / du\ d / du\ 
dy \ dx ) dx \dy J 



( 2 . 8 ) 



Now if our differential equation (2.1) is of the form of Eq. (2.7), we must be able 
to identify 

f(x,y) = du/dx and g(x,y) = du/dy. (2.9) 



Then it follows from Eq. (2.8) that 

dg(x,y) = df(x,y) 
dx dy 

which is Eq. (2.6). 



Example 2.6 

Show that the equation xdy/dx + (x + y) = 0 is exact and find its general solu- 
tion. 



Solution: We first write the equation in standard form 

(x + y)dx + xdy = 0. 

Applying the test of Eq. (2.6) we notice that 

dfd dg dx 

6y = ey {x + y) = l a " d = dx =L 

Therefore the equation is exact, and the solution is of the form indicated by Eq. 
(2.7). From Eq. (2.9) we have 

du/dx = x + y, du/dy = x , 

from which it follows that 

u(x,y) = x 2 / 2 + xy + h(y), u(x,y) = xy + k(x), 

where h(y) and k(x') arise from integrating u(x,y) with respect to x and y, respec- 
tively. For consistency, we require that 

h(y) = 0 and k(x) = x 2 / 2. 



Thus the required solution is 



x 2 / 2 + xy = c. 



It is interesting to consider a differential equation of the type 

dy 



g(x, y)j x +f C x , y ) = k{x ) , 



( 2 . 10 ) 
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where the left hand side is an exact differential (d/dx)\u{x,y)\, and k(x) on the 
right hand side is a function of x only. Then the solution of the differential 
equation can be written as 

u(x,y) = j k{x)dx. (2-11) 

Alternatively Eq. (2.10) can be rewritten as 

g(x,y) ^ + \f(x,y) - k(x)\ = 0. (2.10a) 

Since the left hand side of Eq. (2.10) is exact, we have 

dg/dx = df /dy. 

Then Eq. (2.10a) is exact as well. To see why, let us apply the test for exactness for 
Eq. (2.10a) which requires 

^ [#(■*> t)] = ^ I/Cr j) - *(*)] = ^ [f(x,y)\- 

Thus Eq. (2.10a) satisfies the necessary requirement for being exact. We can thus 
write its solution as 

U(x,y) = c, 

where 

^7 = g(x, y) and ^ = f(x, y) - k(x) . 

Of course, the solution U(x,y ) = c must agree with Eq. (2.11). 



Integrating factors 

If a differential equation in the form of Eq. (2.1) is not already exact, it sometimes 
can be made so by multiplying by a suitable factor, called an integrating factor. 
Although an integrating factor always exists for each equation in the form of Eq. 
(2.1), it may be difficult to find it. However, if the equation is linear, that is, if can 
be written 

C ^ x +f{x)y = g{x) (2.12) 

an integrating factor of the form 

exp f(x)dx ^ (2-13) 

is always available. It is easy to verify this. Suppose that R(x) is the integrating 
factor we are looking for. Multiplying Eq. (2.12) by R , we have 

R-j- + Rf(x)y = Rg(x), or Rdy + Rf(x)ydx = Rg(x)dx. 
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The right hand side is already integrable; the condition that the left hand side of 
Eq. (2.12) be exact gives 






dR 

dx ’ 



which yields 

clR/dx = Rf(x ), or dR/ R = f(x)dx , 
and integrating gives 

lni? = J f(x)dx 

from which we obtain the integrating factor R we were looking for 

R = exp (^ j f(x)dx'j . 



It is now possible to write the general solution of Eq. (2.12). On applying the 
integrating factor, Eq. (2.12) becomes 



d(ye F ) 

dx 



g(x)e F , 



where F(x) = Jf(x)dx. The solution is clearly given by 



y = e 



J e F g(x)dx + C 



Example 2.7 

Show that the equation xdy/dx + 2y + x 2 = 0 is not exact; then find a suitable 
integrating factor that makes the equation exact. What is the solution of this 
equation? 



Solution: We first write the equation in the standard form 

(2y + x 2 )dx + xdy = 0; 

then we notice that 



S- (2v + x 2 ) = 2 and x=l , 
ay ' ox 



which indicates that our equation is not exact. To find the required integrating 
factor that makes our equation exact, we rewrite our equation in the form of Eq. 
( 2 . 12 ): 



dy 

dx 



2 l=-x 
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from which we find / (x) = 1 jx, and so the required integrating factor is 

exp (/(l A*)*) =rap(ln*) = *. 

Applying this to our equation gives 

x 2 — + 2 xy + x 3 = 0 or — (x 2 v + x 4 / 4) = 0 
ax ' ax ' 

which integrates to 

x 2 y + x 4 /4 = c, 



or 



y = 



c — X 
4x 2 



Example 2.8 

RL circuits: A typical RL circuit is shown in Fig. 2.1. Find the current I(t) in the 
circuit as a function of time t. 



Solution: We need first to establish the differential equation for the current 

flowing in the circuit. The resistance R and the inductance L are both constant. 
The voltage drop across the resistance is IR, and the voltage drop across the 
inductance is Ldl /dt. Kirchhoff’s second law for circuits then gives 

L d ^ + RI(t)=E(t), 

which is in the form of Eq. (2. 12), but with t as the independent variable instead of 
x and / as the dependent variable instead of y. Thus we immediately have the 
general solution 

7(0 = \f RtlL J e R,/L E(t)dt + ke~ R,/L , 




Figure 2.1. RL circuit. 
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where k is a constant of integration (in electric circuits, C is used for capacitance). 
Given E this equation can be solved for 7(7). If the voltage E is constant, we 
obtain 



I {t)= }. e -^ L 



r — e~ R, ! L 

R 



ke -RtlL = ^ + ke -Rt/L 

R 



Regardless of the value of k, we see that 

I(t)-+E/R as t — > oo. 
Setting t = 0 in the solution, we find 

k = 7(0) - E/R. 



Bernoulli’s equation 

Bernoulli’s equation is a non-linear first-order equation that occurs occasionally 
in physical problems: 

dy 



dx 



-f(x)y = s(x)y", 



(2.14) 



where n is not necessarily integer. 

This equation can be made linear by the substitution w = y a with a suitably 
chosen. We find this can be achieved if a = 1 — n: 

w = y 1- " or y — 

This converts Bernoulli’s equation into 

^ + (1 - n)f{x)w = (1 - n)g(x), 

which can be made exact using the integrating factor exp(f(l — n)f(x)dx). 



Second-order equations with constant coefficients 

The general form of the nth-order linear differential equation with constant coef- 
ficients is 



d"y 

dx" 



+ P l 



d" 1 v dv 

dx" P " ' dx ' P " y 



(7)" + p\D" 1 + • • • +p n _\D + p n )y —f(x), 



where P\,P 2 ,-- - are constants, /(x) is some function of x, and D = d/dx. If 
f(x) = 0, the equation is called homogeneous; otherwise it is called a non-homo- 
geneous equation. It is important to note that the symbol D is meaningless unless 
applied to a function of x and is therefore not a mathematical quantity in the 
usual sense. D is an operator. 
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Many of the differential equations of this type which arise in physical problems 
are of second order and we shall consider in detail the solution of the equation 

-jp- + a-^ + by = (. D 2 + aD + b)y =f(t), (2-15) 

where a and b are constants, and t is the independent variable. As an example, the 
equation of motion for a mass on a spring is of the form Eq. (2.15), with a 
representing the friction, c being the constant of proportionality in Hooke’s law 
for the spring, and /(?) some time-dependent external force acting on the mass. 
Eq. (2.15) can also apply to an electric circuit consisting of an inductor, a resistor, 
a capacitor and a varying external voltage. 

The solution of Eq. (2.15) involves first finding the solution of the equation with 
f(t) replaced by zero, that is, 

a-y- + by = (D 2 + aD + b)y = 0; (2-16) 

at at 

this is called the reduced or homogeneous equation corresponding to Eq. (2.15). 



Nature of the solution of linear equations 

We now establish some results for linear equations in general. For simplicity, we 
consider the second-order reduced equation (2.16). If y x and y 2 are independent 
solutions of (2.16) and A and B are any constants, then 

D ( A . 1 ' | By 2 ) = ADi’i + BDy 2 , D (Ay [ + By 2 ) = h B y 1 + BD y 2 

and hence 

( D 2 + aD + b)(Ay\ + By 2 ) = A(D 2 + aD + b)y\ + B(D 2 + aD + b)y 2 = 0. 

Thus y = Ay i + By 2 is a solution of Eq. (2.16), and since it contains two arbitrary 
constants, it is the general solution. A necessary and sufficient condition for two 
solutions ji and y 2 to be linearly independent is that the Wronskian determinant 
of these functions does not vanish: 



y 1 


J2 


d\'\ 


dyi 


dt 


dt 



Similarly, if • • • , y„ are n linearly independent solutions of the nth-order 
linear equations, then the general solution is 

y = ^lTi + A 2 y 2 + • • • + A n y n , 

where A U A 2 , . . . ,A n are arbitrary constants. This is known as the superposition 
principle. 
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General solutions of the second-order equations 



Suppose that we can find one solution, y p (t) say, of Eq. (2.15): 

(D + aD + b)y p (t) = f(t). (2.15a) 

Then on defining 

y c {t) = y{t) -y P {t) 



we find by subtracting Eq. (2.15a) from Eq. (2.15) that 

(D + uD + b)y c (t) = 0. 

That is, y c (t) satisfies the corresponding homogeneous equation (2.16), and it is 
known as the complementary function y c (t) of non-homogeneous equation (2.15). 
while the solution y p (t) is called a particular integral of Eq. (2.15). Thus, the 
general solution of Eq. (2.15) is given by 

y{t) = Ay c {t) + By p {t). (2.17) 



Finding the complementary function 

Clearly the complementary function is independent of f(t), and hence has nothing 
to do with the behavior of the system in response to the external applied influence. 
What it does represent is the free motion of the system. Thus, for example, even 
without external forces applied, a spring can oscillate, because of any initial 
displacement and/or velocity. Similarly, had a capacitor already been charged 
at t = 0, the circuit would subsequently display current oscillations even if there 
is no applied voltage. 

In order to solve Eq. (2.16) for y c {t), we first consider the linear first-order 
equation 

4 + *r.«. 

dt 

Separating the variables and integrating, we obtain 

y = Ae ~ b,la , 

where A is an arbitrary constant of integration. This solution suggests that Eq. 
(2.16) might be satisfied by an expression of the type 

y = <?‘, 

where p is a constant. Putting this into Eq. (2.16), we have 

e pt (p 2 + ap + b) = 0. 

Therefore y = e pt is a solution of Eq. (2.16) if 

p 2 + ap J rb = 0. 
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